BeClaude
Research2026-04-28

Leveraging Human Feedback for Semantically-Relevant Skill Discovery

Source: Arxiv CS.AI

arXiv:2604.24127v1 Announce Type: cross Abstract: Unsupervised skill discovery in reinforcement learning aims to intrinsically motivate agents to discover diverse and useful behaviours. However, unconstrained approaches can produce unsafe, unethical, or misaligned behaviours. To mitigate these...

arxivpapersrag