Research2026-05-12
EnactToM: An Evolving Benchmark for Functional Theory of Mind in Embodied Agents
Source: Arxiv CS.AI
arXiv:2605.09826v1 Announce Type: new Abstract: Theory of Mind (ToM), the ability to track others epistemic state, makes humans efficient collaborators. AI agents need the same capacity in multi agent settings, yet existing benchmarks mostly test literal ToM by asking direct belief questions. The...
arxivpapersagentsbenchmark