BeClaude
Research2026-05-12

Is Data Shapley Not Better than Random in Data Selection? Ask NASH

Source: Arxiv CS.AI

arXiv:2605.10684v1 Announce Type: cross Abstract: Data selection studies the problem of identifying high-quality subsets of training data. While some existing works have considered selecting the subset of data with top-$m$ Data Shapley or other semivalues as they account for the interaction among...

arxivpapers