BeClaude
Research2026-04-24

Strategic Scaling of Test-Time Compute: A Bandit Learning Approach

Source: Arxiv CS.AI

arXiv:2506.12721v2 Announce Type: replace Abstract: Scaling test-time compute has emerged as an effective strategy for improving the performance of large language models. However, existing methods typically allocate compute uniformly across all queries, overlooking variation in query difficulty. To...

arxivpapers