BeClaude
Research2026-05-14

Formal Conjectures: An Open and Evolving Benchmark for Verified Discovery in Mathematics

Source: Arxiv CS.AI

arXiv:2605.13171v1 Announce Type: new Abstract: As automated reasoning systems advance rapidly, there is a growing need for research-level formal mathematical problems to accurately evaluate their capabilities. To address this, we present Formal Conjectures, an evolving benchmark of currently 2615...

arxivpapersbenchmark