Back to News
Research2026-04-17
Pruning Long Chain-of-Thought of Large Reasoning Models via Small-Scale Preference Optimization
Source: Arxiv CS.AI
arXiv:2508.10164v2 Announce Type: replace Abstract: Recent advances in Large Reasoning Models (LRMs) have demonstrated strong performance on complex tasks through long Chain-of-Thought (CoT) reasoning. However, their lengthy outputs increase computational costs and may lead to overthinking, raising...
arxivpapersreasoning