Research2026-05-12
SciIntegrity-Bench: A Benchmark for Evaluating Academic Integrity in AI Scientist Systems
Source: Arxiv CS.AI
arXiv:2605.10246v1 Announce Type: new Abstract: AI scientist systems are increasingly deployed for autonomous research, yet their academic integrity has never been systematically evaluated. We introduce SCIINTEGRITY-BENCH, the first benchmark designed around a dilemmatic evaluation paradigm: each...
arxivpapersbenchmark