Research2026-05-06
A Sentence Relation-Based Approach to Sanitizing Malicious Instructions
Source: Arxiv CS.AI
arXiv:2605.01078v1 Announce Type: cross Abstract: Retrieval-augmented generation and tool-integrated LLM agents increasingly depend on external textual sources. This reliance broadens the available attack surface, allowing adversaries to insert malicious instructions that trigger unintended model...
arxivpapers