BeClaude
Research2026-05-12

Done, But Not Sure: Disentangling World Completion from Self-Termination in Embodied Agents

Source: Arxiv CS.AI

arXiv:2605.08747v1 Announce Type: new Abstract: Standard embodied evaluations do not independently score whether an agent correctly commits to task completion at episode closure, a capacity we call terminal commitment. Behaviorally distinct failures--never completing the task, completing it but...

arxivpapersagents