Research2026-05-12
ROM: Real-time Overthinking Mitigation via Streaming Detection and Intervention
Source: Arxiv CS.AI
arXiv:2603.22016v2 Announce Type: replace-cross Abstract: Large Reasoning Models (LRMs) often reach a correct solution before their long Chain-of-Thought trace ends, yet continue with redundant verification, repeated attempts, or unnecessary exploration that wastes computation and can even overturn...
arxivpapers