BeClaude
Research2026-04-23

What Makes a Good AI Review? Concern-Level Diagnostics for AI Peer Review

Source: Arxiv CS.AI

arXiv:2604.19998v1 Announce Type: new Abstract: Evaluating AI-generated reviews by verdict agreement is widely recognized as insufficient, yet current alternatives rarely audit which concerns a system identifies, how it prioritizes them, or whether those priorities align with the review rationale...

arxivpapers