Research2026-04-17

Pruning Long Chain-of-Thought of Large Reasoning Models via Small-Scale Preference Optimization

arXiv:2508.10164v2 Announce Type: replace Abstract: Recent advances in Large Reasoning Models (LRMs) have demonstrated strong performance on complex tasks through long Chain-of-Thought (CoT) reasoning. However, their lengthy outputs increase computational costs and may lead to overthinking, raising...

Read Original Article on Arxiv CS.AI

arxivpapersreasoning