Research2026-05-12
Step Rejection Fine-Tuning: A Practical Distillation Recipe
Source: Arxiv CS.AI
arXiv:2605.10674v1 Announce Type: cross Abstract: Rejection Fine-Tuning (RFT) is a standard method for training LLM agents, where unsuccessful trajectories are discarded from the training set. In the context of SWE-bench tasks, this corresponds to filtering out runs where the submitted patch does...
arxivpapersfine-tuning