BeClaude
Back to News
Research2026-04-17

The Signal is in the Steps: Local Scoring for Reasoning Data Selection

Source: Arxiv CS.AI

arXiv:2510.03988v2 Announce Type: replace-cross Abstract: Distilling long-form reasoning from teacher models into smaller students requires selecting which candidate solutions to train on. Recent work argues that one should select responses the student model assigns highest probability, i.e.,...

arxivpapersreasoning