Research2026-05-06
AcademiClaw: When Students Set Challenges for AI Agents
Source: Arxiv CS.AI
arXiv:2605.02661v1 Announce Type: new Abstract: Benchmarks within the OpenClaw ecosystem have thus far evaluated exclusively assistant-level tasks, leaving the academic-level capabilities of OpenClaw largely unexamined. We introduce AcademiClaw, a bilingual benchmark of 80 complex, long-horizon...
arxivpapersagents