Research2026-04-24

Stealthy Backdoor Attacks against LLMs Based on Natural Style Triggers

arXiv:2604.21700v1 Announce Type: cross Abstract: The growing application of large language models (LLMs) in safety-critical domains has raised urgent concerns about their security. Many recent studies have demonstrated the feasibility of backdoor attacks against LLMs. However, existing methods...

Read Original Article on Arxiv CS.AI

arxivpapers