Research2026-05-06
SCARV: Structure-Constrained Aggregation for Stable Sample Ranking in Redundant NLP Datasets
Source: Arxiv CS.AI
arXiv:2605.00944v1 Announce Type: cross Abstract: Sample-level rankings are increasingly used in data-centric NLP for analysis, filtering, debugging, and curation, yet existing pipelines typically score training examples pointwise and rank them as if they were independent. This assumption is...
arxivpapers