BeClaude
Research2026-05-12

WorldSpeech: A Multilingual Speech Corpus from Around the World

Source: Arxiv CS.AI

arXiv:2605.09167v1 Announce Type: cross Abstract: Automatic speech recognition (ASR) performs well for high-resource languages with abundant paired audio-transcript data, but its accuracy degrades sharply for most languages due to limited publicly available aligned data. To this end, we introduce...

arxivpapers