Best Open Source text to speech Libraries

A curated list of the most popular GitHub repositories tagged with text to speech. Select any project to visualize its architecture and dive into the codebase using RepoMind's AI engine.

#1RVC-Boss/GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

55,156Python

Analyze Code

#2unslothai/unsloth

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek, Qwen, Llama, Gemma, TTS 2x faster with 70% less VRAM.

52,562Python

Analyze Code

#3coqui-ai/TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

44,580Python

Analyze Code

#42noise/ChatTTS

A generative speech model for daily dialogue.

38,740Python

Analyze Code

#5babysor/MockingBird

🚀Clone a voice in 5 seconds to generate arbitrary speech in real-time

36,875Python

Analyze Code

#6myshell-ai/OpenVoice

Instant voice cloning by MIT and MyShell. Audio foundation model.

35,985Python

Analyze Code

#7FunAudioLLM/CosyVoice

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

19,659Python

Analyze Code

#8nari-labs/dia

A TTS model capable of generating ultra-realistic dialogue in one pass.

19,120Python

Analyze Code

#9index-tts/index-tts

An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System

18,867Python

Analyze Code

#10espnet/espnet

End-to-End Speech Processing Toolkit

9,742Python

Analyze Code

#11open-mmlab/Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

9,694Python

Analyze Code