Research2026-05-12
SynerDiff: Synergetic Continuous Batching for Fast and Parallel Diffusion Model Inference
Source: Arxiv CS.AI
arXiv:2605.08835v1 Announce Type: new Abstract: The expansion of Artificial Intelligence-generated content service requires diffusion model serving to simultaneously achieve high throughput and low task end-to-end (E2E) latency. However, existing continuous batching methods suffer from severe...
arxivpapersimage-generation