Back to News
Research2025-04-02
PaperBench: Evaluating AI’s Ability to Replicate AI Research
Source: OpenAI
We introduce PaperBench, a benchmark evaluating the ability of AI agents to replicate state-of-the-art AI research.
openaigpt
We introduce PaperBench, a benchmark evaluating the ability of AI agents to replicate state-of-the-art AI research.