Research2026-04-28
Leveraging Human Feedback for Semantically-Relevant Skill Discovery
Source: Arxiv CS.AI
arXiv:2604.24127v1 Announce Type: cross Abstract: Unsupervised skill discovery in reinforcement learning aims to intrinsically motivate agents to discover diverse and useful behaviours. However, unconstrained approaches can produce unsafe, unethical, or misaligned behaviours. To mitigate these...
arxivpapersrag