Partnership2026-04-28
Hearing to Translate: The Effectiveness of Speech Modality Integration into LLMs
Source: Arxiv CS.AI
arXiv:2512.16378v4 Announce Type: replace-cross Abstract: As Large Language Models (LLMs) expand beyond text, integrating speech as a native modality has given rise to SpeechLLMs, which directly process spoken language and enable speech-to-text translation (ST) and other downstream tasks, bypassing...
arxivpapers