Research2026-05-06
Leveraging Data Symmetries to Select an Optimal Subset of Training Data under Label Noise
Source: Arxiv CS.AI
arXiv:2605.01874v1 Announce Type: cross Abstract: The performance of machine learning models often relies on large labeled datasets; however, data collected from diverse sources can contain label noise. Recent work has shown that, in noisy settings, there may exist a subset of the training data on...
arxivpapersrag