Research2026-04-28
DLM: Unified Decision Language Models for Offline Multi-Agent Sequential Decision Making
Source: Arxiv CS.AI
arXiv:2604.23557v1 Announce Type: cross Abstract: Building scalable and reusable multi-agent decision policies from offline datasets remains a challenge in offline multi-agent reinforcement learning (MARL), as existing methods often rely on fixed observation formats and action spaces that limit...
arxivpapersagents