Cantonese Speech Corpus Breakthrough: How WenetSpeech-Yue’s 21K Hours Transform AI

13 hours ago 高效码农

WenetSpeech-Yue: A Large-Scale Cantonese Speech Corpus with Multi-Dimensional Annotation Why Cantonese Speech Processing Demands Large-Scale Annotated Resources Cantonese, spoken by approximately 84.9 million native speakers worldwide, presents unique challenges for speech processing due to its rich tone system of nine tones in six categories, coexistence of literary and colloquial forms, and frequent code-switching with English. Despite its linguistic complexity and cultural significance, Cantonese has remained severely under-resourced in speech technology compared to major languages. The development of WenetSpeech-Yue addresses this critical gap by providing the largest open-source Cantonese speech corpus with comprehensive multi-dimensional annotations. The WenetSpeech-Pipe Framework: Building High-Quality Speech …