BeClaude
Research2026-04-23

Measuring the Machine: Evaluating Generative AI as Pluralist Sociotechical Systems

Source: Arxiv CS.AI

arXiv:2604.20545v1 Announce Type: new Abstract: In measurement theory, instruments do not simply record reality; they help constitute what is observed. The same holds for generative AI evaluation: benchmarks do not just measure, they shape what models appear to be. Functionalist benchmarks treat...

arxivpapers