Research2026-05-12
EcoGym: Evaluating LLMs for Long-Horizon Plan-and-Execute in Interactive Economies
Source: Arxiv CS.AI
arXiv:2602.09514v3 Announce Type: replace-cross Abstract: Long-horizon planning is widely recognized as a core capability of autonomous LLM-based agents; however, current evaluation frameworks suffer from being largely episodic, domain-specific, or insufficiently grounded in persistent economic...
arxivpapers