Research2026-04-23
ATIR: Towards Audio-Text Interleaved Contextual Retrieval
Source: Arxiv CS.AI
arXiv:2604.20267v1 Announce Type: cross Abstract: Audio carries richer information than text, including emotion, speaker traits, and environmental context, while also enabling lower-latency processing compared to speech-to-text pipelines. However, recent multimodal information retrieval research...
arxivpapers