Best Open Source speech Libraries

A curated list of the most popular GitHub repositories tagged with speech. Select any project to visualize its architecture and dive into the codebase using RepoMind's AI engine.

#1coqui-ai/TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

44,580Python

Analyze Code

#2babysor/MockingBird

🚀Clone a voice in 5 seconds to generate arbitrary speech in real-time

36,875Python

Analyze Code

#3svc-develop-team/so-vits-svc

SoftVC VITS Singing Voice Conversion

27,989Python

Analyze Code

#4huggingface/datasets

🤗 The largest hub of ready-to-use datasets for AI models with fast, easy-to-use and efficient data manipulation tools

21,200Python

Analyze Code

#5m-bain/whisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

20,249Python

Analyze Code

#6modelscope/modelscope

ModelScope: bring the notion of Model-as-a-Service to life.

8,720Python

Analyze Code

#7PaddlePaddle/models

Officially maintained, supported by PaddlePaddle, including CV, NLP, Speech, Rec, TS, big models and so on.

6,948Python

Analyze Code

#8TalAter/annyang

💬 Speech recognition for your site

6,672JavaScript

Analyze Code