Research2026-05-12

EcoGym: Evaluating LLMs for Long-Horizon Plan-and-Execute in Interactive Economies

arXiv:2602.09514v3 Announce Type: replace-cross Abstract: Long-horizon planning is widely recognized as a core capability of autonomous LLM-based agents; however, current evaluation frameworks suffer from being largely episodic, domain-specific, or insufficiently grounded in persistent economic...

Read Original Article on Arxiv CS.AI

arxivpapers