Research2026-04-30

Student Guides Teacher: Weak-to-Strong Inference via Spectral Orthogonal Exploration

arXiv:2601.06160v2 Announce Type: replace Abstract: Large Language Models (LLMs) often suffer from ''Reasoning Collapse'' on challenging mathematical reasoning tasks, where stochastic sampling produces lexical variations of the same erroneous logic rather than genuine semantic exploration. We...

Read Original Article on Arxiv CS.AI

arxivpapers