BeClaude
Research2026-05-08

Learning Discrete Autoregressive Priors with Wasserstein Gradient Flow

Source: Arxiv CS.AI

arXiv:2605.06148v1 Announce Type: cross Abstract: Discrete image tokenizers are commonly trained in two stages: first for reconstruction, and then with a prior model fitted to the frozen token sequences. This decoupling leaves the tokenizer unaware of the model that will later generate its tokens....

arxivpapers