Research2026-05-01
Instruction Complexity Induces Positional Collapse in Adversarial LLM Evaluation
Source: Arxiv CS.AI
arXiv:2604.27249v1 Announce Type: cross Abstract: When instructed to underperform on multiple-choice evaluations, do language models engage with question content or fall back on positional shortcuts? We map the boundary between these regimes using a six-condition adversarial instruction-specificity...
arxivpapers