Research2026-04-30

When to Vote, When to Rewrite: Disagreement-Guided Strategy Routing for Test-Time Scaling

arXiv:2604.26644v1 Announce Type: new Abstract: Large Reasoning Models (LRMs) achieve strong performance on mathematical reasoning tasks but remain unreliable on challenging instances. Existing test-time scaling methods, such as repeated sampling, self-correction, and tree search, improve...

Read Original Article on Arxiv CS.AI

arxivpapers