Research2026-05-14

Filter-then-Weight: Online Data Selection and Reweighting for LLM Fine-Tuning

arXiv:2604.00001v2 Announce Type: replace-cross Abstract: Gradient-based data selection offers a principled framework for estimating sample utility in large language model (LLM) fine-tuning, but existing methods are mostly designed for offline settings. They are therefore less suited to online...

Read Original Article on Arxiv CS.AI

arxivpapersfine-tuning