BeClaude
Research2026-05-12

Memorize Theorems, Not Instances: Probing SFT Generalization through Mathematical Reasoning

Source: Arxiv CS.AI

arXiv:2605.09270v1 Announce Type: cross Abstract: Supervised Fine-Tuning (SFT) is widely used for task-specific adaptation, yet recent work shows it systematically undermines reasoning generalization. We argue the root cause is not memorization itself, but its target: vanilla SFT drives models to...

arxivpapersreasoning