Research2026-05-12
Where Do Reasoning Models Refuse?
Source: Arxiv CS.AI
arXiv:2507.03167v3 Announce Type: replace-cross Abstract: Chat models without chain-of-thought (CoT) reasoning must decide whether to refuse a harmful request before generating their first response token. Reasoning models, by contrast, produce extended chains of thought before their final output,...
arxivpapersreasoning