Research2026-05-12
PoDAR: Power-Disentangled Audio Representation for Generative Modeling
Source: Arxiv CS.AI
arXiv:2605.10084v1 Announce Type: cross Abstract: The performance of audio latent diffusion models is primarily governed by generator expressivity and the modelability of the underlying latent space. While recent research has focused primarily on the former, as well as improving the reconstruction...
arxivpapers