Research2026-04-24
Stealthy Backdoor Attacks against LLMs Based on Natural Style Triggers
Source: Arxiv CS.AI
arXiv:2604.21700v1 Announce Type: cross Abstract: The growing application of large language models (LLMs) in safety-critical domains has raised urgent concerns about their security. Many recent studies have demonstrated the feasibility of backdoor attacks against LLMs. However, existing methods...
arxivpapers