Research2026-05-12

SciIntegrity-Bench: A Benchmark for Evaluating Academic Integrity in AI Scientist Systems

arXiv:2605.10246v1 Announce Type: new Abstract: AI scientist systems are increasingly deployed for autonomous research, yet their academic integrity has never been systematically evaluated. We introduce SCIINTEGRITY-BENCH, the first benchmark designed around a dilemmatic evaluation paradigm: each...

Read Original Article on Arxiv CS.AI

arxivpapersbenchmark