Research2026-05-08
BehaviorGuard: Online Backdoor Defense for Deep Reinforcement Learning
Source: Arxiv CS.AI
arXiv:2605.05977v1 Announce Type: new Abstract: Backdoor attacks pose a serious threat to deep reinforcement learning (DRL). Current defenses typically rely on reward anomalies to reverse-engineer triggers and model finetuning to remove backdoors. However, complex trigger patterns undermine their...
arxivpapersrl