BeClaude
Research2026-05-12

Where Do Reasoning Models Refuse?

Source: Arxiv CS.AI

arXiv:2507.03167v3 Announce Type: replace-cross Abstract: Chat models without chain-of-thought (CoT) reasoning must decide whether to refuse a harmful request before generating their first response token. Reasoning models, by contrast, produce extended chains of thought before their final output,...

arxivpapersreasoning