OpenBMB has released VoxCPM 2, a cutting-edge text-to-speech (TTS) model featuring 2 billion parameters and designed for multilingual voice synthesis across 30 languages. This model advances the previous VoxCPM releases by enhancing long-text stability and supporting zero-shot voice creation without a reference audio, making it suitable for professional applications such as filmmaking and gaming. VoxCPM 2 is capable of producing high-quality, expressive speech with features like controllable voice cloning and a unique voice design that utilizes natural-language descriptions to craft new voices. This evolution is part of a broader trend in TTS technology aiming to provide seamless and realistic multilingual capabilities.
OpenBMB: OpenBMB, or Open Lab for Big Model Base, is an AI research organization dedicated to developing open-source foundation models and systems advancing towards artificial general intelligence. The lab produces projects like ChatDev and multimodal models such as MiniCPM. In this news, OpenBMB launched VoxCPM 2, their latest tokenizer-free TTS model designed for production-grade multilingual speech synthesis with advanced voice cloning features.
VoxCPM 2: VoxCPM 2 is an open-source text-to-speech model employing a tokenizer-free diffusion autoregressive architecture built on a MiniCPM backbone for generating natural, expressive speech in continuous latent space. It enables zero-shot voice design from natural language descriptions, controllable voice cloning from short audio clips, and high-fidelity continuation cloning while producing studio-quality audio. The release introduces expanded multilingual capabilities including Chinese dialects and Southeast Asian languages, positioning it for professional applications in content creation and voice agents.
Apache-2.0: Apache-2.0 is a permissive open-source software license that allows free use, modification, and distribution for both commercial and non-commercial purposes with minimal restrictions. It requires preservation of copyright notices and disclaimers in distributions. In the news, VoxCPM 2 is released under the Apache-2.0 license, enabling broad commercial deployment.
Model Evolution: Evolves from initial VoxCPM releases in late 2025 with improvements in long-text stability, fine-tuning support, and now unified multilingual voice generation.
Community Integration: Rapidly adopted in tools like ComfyUI for node-based voice cloning and LoRA training workflows.
Benchmark Competitiveness: Delivers state-of-the-art results on zero-shot TTS evaluations like Seed-TTS-eval and multilingual tests against models from CosyVoice and Qwen.
Sources
- https://www.linkedin.com/posts/charlywargnier_the-new-era-of-open-source-tts-is-here-activity-7447055492704538626-JbOO
- https://x.com/i/status/2043520939988811820
- https://github.com/OpenBMB/VoxCPM
- https://huggingface.co/openbmb/VoxCPM2/discussions
- https://x.com/i/status/2043521442206388432
- https://trendshift.io/repositories/17704
- https://x.com/i/status/2042428184201998764
- https://x.com/i/status/2043297327868227609
- https://www.reddit.com/r/LocalLLaMA/comments/1sg89kl/new_tts_model_voxcpm2
- https://x.com/i/status/2042064861971267879
- https://huggingface.co/openbmb/VoxCPM2
- https://www.youtube.com/watch?v=jy1mQFIrqXo
- https://github.com/OpenBMB
- https://x.com/i/status/2042925603217670418
- https://www.instagram.com/p/DW9Jiq8gQSW
- https://x.com/i/status/2041192067947241893
- https://x.com/i/status/2042412688874389542
- https://medium.com/@openbmb
- https://huggingface.co/openbmb
- https://x.com/i/status/2043513717426708630
- https://openbmb.github.io/IoA
- https://x.com/i/status/2042429142780768643
- https://github.com/OpenBMB/ChatDev/issues/588
- https://x.com/OpenBMB
- https://x.com/_akhaliq/status/2041199432654278829
- https://x.com/i/status/2040055394958286903
- https://huggingface.co/openbmb/models
- https://x.com/i/status/2041863847498199127
- https://x.com/i/status/2034435998692647201
- https://x.com/i/status/2041192329391083828
- https://chatdev.ai/
- https://x.com/i/status/2041169065020936464
- https://www.raiaai.com/blogs/raia-agent-platform-vs-openbmb-transforming-ai-development-through-team-collaboration
- https://www.youtube.com/watch?v=VD0TuPfsgfA
- https://x.com/i/status/2043456171869081752
- https://voxcpm.net/
- https://x.com/i/status/2043167839079506023
