GLM-TTS: The New Open-Source Benchmark for Emotional Zero-Shot Chinese TTS Core question most developers are asking in late 2025: Is there finally a fully open-source TTS that can clone any voice with 3–10 seconds of audio, sound emotional, stream in real-time, and handle Chinese polyphones accurately? The answer is yes — and it launched today. On December 11, 2025, Zhipu AI open-sourced GLM-TTS: a production-ready, zero-shot, emotionally expressive text-to-speech system that is currently the strongest open-source Chinese TTS available. Image credit: Official repository Why GLM-TTS Changes Everything — In Four Bullet Points Zero-shot voice cloning: 3–10 s reference audio is …