BeClaude
Research2026-05-01

Instruction Complexity Induces Positional Collapse in Adversarial LLM Evaluation

Source: Arxiv CS.AI

arXiv:2604.27249v1 Announce Type: cross Abstract: When instructed to underperform on multiple-choice evaluations, do language models engage with question content or fall back on positional shortcuts? We map the boundary between these regimes using a six-condition adversarial instruction-specificity...

arxivpapers