Research2026-04-30
Student Guides Teacher: Weak-to-Strong Inference via Spectral Orthogonal Exploration
Source: Arxiv CS.AI
arXiv:2601.06160v2 Announce Type: replace Abstract: Large Language Models (LLMs) often suffer from ''Reasoning Collapse'' on challenging mathematical reasoning tasks, where stochastic sampling produces lexical variations of the same erroneous logic rather than genuine semantic exploration. We...
arxivpapers