BeClaude
Research2026-05-12

Quantile Geometry Regularization for Distributional Reinforcement Learning

Source: Arxiv CS.AI

arXiv:2605.08182v1 Announce Type: cross Abstract: Quantile-based distributional reinforcement learning methods learn return distributions through sampled quantile regression, but their bootstrapped target quantiles may induce distorted or degenerate distribution estimates. We propose Robust...

arxivpapersrl