Research2026-05-12
Memorize Theorems, Not Instances: Probing SFT Generalization through Mathematical Reasoning
Source: Arxiv CS.AI
arXiv:2605.09270v1 Announce Type: cross Abstract: Supervised Fine-Tuning (SFT) is widely used for task-specific adaptation, yet recent work shows it systematically undermines reasoning generalization. We argue the root cause is not memorization itself, but its target: vanilla SFT drives models to...
arxivpapersreasoning