Research2026-05-14
Filter-then-Weight: Online Data Selection and Reweighting for LLM Fine-Tuning
Source: Arxiv CS.AI
arXiv:2604.00001v2 Announce Type: replace-cross Abstract: Gradient-based data selection offers a principled framework for estimating sample utility in large language model (LLM) fine-tuning, but existing methods are mostly designed for offline settings. They are therefore less suited to online...
arxivpapersfine-tuning