Best Open Source text to speech Libraries
A curated list of the most popular GitHub repositories tagged with text to speech. Select any project to visualize its architecture and dive into the codebase using RepoMind's AI engine.
#1RVC-Boss/GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
#2unslothai/unsloth
Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek, Qwen, Llama, Gemma, TTS 2x faster with 70% less VRAM.
#3coqui-ai/TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
#42noise/ChatTTS
A generative speech model for daily dialogue.
#5babysor/MockingBird
🚀Clone a voice in 5 seconds to generate arbitrary speech in real-time
#6myshell-ai/OpenVoice
Instant voice cloning by MIT and MyShell. Audio foundation model.
#7FunAudioLLM/CosyVoice
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
#8nari-labs/dia
A TTS model capable of generating ultra-realistic dialogue in one pass.
#9index-tts/index-tts
An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System
#10espnet/espnet
End-to-End Speech Processing Toolkit
#11open-mmlab/Amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.