Research2026-05-12
Quantile Geometry Regularization for Distributional Reinforcement Learning
Source: Arxiv CS.AI
arXiv:2605.08182v1 Announce Type: cross Abstract: Quantile-based distributional reinforcement learning methods learn return distributions through sampled quantile regression, but their bootstrapped target quantiles may induce distorted or degenerate distribution estimates. We propose Robust...
arxivpapersrl