BeClaude
Research2026-05-14

DAWM: Diffusion Action World Models for Offline Reinforcement Learning via Action-Inferred Transitions

Source: Arxiv CS.AI

arXiv:2509.19538v2 Announce Type: replace-cross Abstract: Diffusion-based world models have demonstrated strong capabilities in synthesizing realistic long-horizon trajectories for offline reinforcement learning (RL). However, many existing methods do not directly generate actions alongside states...

arxivpapersimage-generationrl