Research2026-05-06
Principles and Guidelines for Randomized Controlled Trials in AI Evaluation
Source: Arxiv CS.AI
arXiv:2605.02050v1 Announce Type: cross Abstract: This work establishes a foundational framework for standardizing AI evaluation RCTs (sometimes called human uplift studies). Drawing on established experimental practices from disciplines with established RCT traditions, including software...
arxivpapers