Research2026-04-28

RouteGuard: Internal-Signal Detection of Skill Poisoning in LLM Agents

arXiv:2604.22888v1 Announce Type: cross Abstract: Agent skills introduce a new and more severe form of indirect injection for LLM agents: unlike traditional indirect prompt injection, attackers can hide malicious instructions inside a dense, action-oriented skill that already functions as a...

Read Original Article on Arxiv CS.AI

arxivpapersagents