BeClaude
Research2026-05-12

EnactToM: An Evolving Benchmark for Functional Theory of Mind in Embodied Agents

Source: Arxiv CS.AI

arXiv:2605.09826v1 Announce Type: new Abstract: Theory of Mind (ToM), the ability to track others epistemic state, makes humans efficient collaborators. AI agents need the same capacity in multi agent settings, yet existing benchmarks mostly test literal ToM by asking direct belief questions. The...

arxivpapersagentsbenchmark