Research2026-04-28

Leveraging Human Feedback for Semantically-Relevant Skill Discovery

arXiv:2604.24127v1 Announce Type: cross Abstract: Unsupervised skill discovery in reinforcement learning aims to intrinsically motivate agents to discover diverse and useful behaviours. However, unconstrained approaches can produce unsafe, unethical, or misaligned behaviours. To mitigate these...

Read Original Article on Arxiv CS.AI

arxivpapersrag