Research2026-05-11
Randomness is sometimes necessary for coordination
Source: Arxiv CS.AI
arXiv:2605.06825v1 Announce Type: new Abstract: Full parameter sharing is standard in cooperative multi-agent reinforcement learning (MARL) for homogeneous agents. Under permutation-symmetric observations, however, a shared deterministic policy outputs identical action distributions for every...
arxivpapers