哔哩哔哩Index团队近日宣布,其自主研发的自回归零样本文本转语音系统IndexTTS-2.0已全面开源。该系统被视为零样本TTS技术迈向实用化的关键里程碑,通过时间编码机制和音色与情感解耦建模两项核心创新,解决了语音合成领域中时长控制与情感表达的技术难题。IndexTTS-2.0在语音生成方面表现出极高的灵活性,可广泛应用于AI配音、有声读物、动态漫画、视频翻译、语音对话及播客制作等多种场景,为全球内容出海提供了重要技术支撑,降低了优质内容跨语言传播的门槛。目前,IndexTTS-2.0已同步开源项目论文、完整代码、模型权重及在线体验页面。IndexTTS团队表示,未来将持续优化模型性能,并逐步释放更多资源与工具,与开发者社区共同构建开放、繁荣的语音技术生态。GitHub地址:GitHub - index-tts/index-tts: An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System。论文地址:[2506.21619] IndexTTS2: A Breakthrough in Emotionally Expressive and Duration-Controlled Auto-Regressive Zero-Shot Text-to-Speech。Demo展示地址:IndexTTS2: A Breakthrough in Emotionally Expressive and Duration-Controlled Auto-Regressive Zero-Shot Text-to-Speech。模型下载地址:魔搭社区/IndexTTS-2、Hugging Face/ IndexTTS-2。在线体验地址:https://huggingface.co/spaces/IndexTeam/IndexTTS-2-Demo。