Research2026-05-11
Agentick: A Unified Benchmark for General Sequential Decision-Making Agents
Source: Arxiv CS.AI
arXiv:2605.06869v1 Announce Type: new Abstract: AI agent research spans a wide spectrum: from RL agents that learn from scratch to foundation model agents that leverage pre-trained knowledge, yet no unified benchmark enables fair comparison across these approaches. We present Agentick, a benchmark...
arxivpapersagentsbenchmark