Research2026-05-14
Systematic Failures in Collective Reasoning under Distributed Information in Multi-Agent LLMs
Source: Arxiv CS.AI
arXiv:2505.11556v4 Announce Type: replace-cross Abstract: Multi-agent systems built on large language models (LLMs) are expected to enhance decision-making by pooling distributed information, yet systematically evaluating this capability has remained challenging. We introduce HiddenBench, a 65-task...
arxivpapersreasoningagents