back to home

Best Open Source text to speech Libraries

A curated list of the most popular GitHub repositories tagged with text to speech. Select any project to visualize its architecture and dive into the codebase using RepoMind's AI engine.

#1RVC-Boss/GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

55,156Python
Analyze Code

#2unslothai/unsloth

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek, Qwen, Llama, Gemma, TTS 2x faster with 70% less VRAM.

52,562Python
Analyze Code

#3coqui-ai/TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

44,580Python
Analyze Code

#42noise/ChatTTS

A generative speech model for daily dialogue.

38,740Python
Analyze Code

#5babysor/MockingBird

🚀Clone a voice in 5 seconds to generate arbitrary speech in real-time

36,875Python
Analyze Code

#6myshell-ai/OpenVoice

Instant voice cloning by MIT and MyShell. Audio foundation model.

35,985Python
Analyze Code

#7FunAudioLLM/CosyVoice

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

19,659Python
Analyze Code

#8nari-labs/dia

A TTS model capable of generating ultra-realistic dialogue in one pass.

19,120Python
Analyze Code

#9index-tts/index-tts

An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System

18,867Python
Analyze Code

#10espnet/espnet

End-to-End Speech Processing Toolkit

9,742Python
Analyze Code

#11open-mmlab/Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

9,694Python
Analyze Code