Research2026-05-12

General Agent Evaluation

arXiv:2602.22953v2 Announce Type: replace Abstract: General-purpose agents perform tasks in unfamiliar environments without domain-specific manual customization. Yet no study has systematically measured how agent architecture shapes performance across heterogeneous protocols and diverse unfamiliar...

Read Original Article on Arxiv CS.AI

arxivpapersagents