Research2026-04-23
Measuring the Machine: Evaluating Generative AI as Pluralist Sociotechical Systems
Source: Arxiv CS.AI
arXiv:2604.20545v1 Announce Type: new Abstract: In measurement theory, instruments do not simply record reality; they help constitute what is observed. The same holds for generative AI evaluation: benchmarks do not just measure, they shape what models appear to be. Functionalist benchmarks treat...
arxivpapers