BeClaude
Research2026-05-12

The Last Word Often Wins: A Format Confound in Chain-of-Thought Corruption Studies

Source: Arxiv CS.AI

arXiv:2605.10799v1 Announce Type: cross Abstract: Corruption studies, the primary tool for evaluating chain-of-thought (CoT) faithfulness, identify which chain positions are "computationally important" by measuring accuracy when steps are replaced with errors. We identify a systematic confound: for...

arxivpapers