Best Open Source speech Libraries
A curated list of the most popular GitHub repositories tagged with speech. Select any project to visualize its architecture and dive into the codebase using RepoMind's AI engine.
#1coqui-ai/TTS
๐ธ๐ฌ - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
#2babysor/MockingBird
๐Clone a voice in 5 seconds to generate arbitrary speech in real-time
#3svc-develop-team/so-vits-svc
SoftVC VITS Singing Voice Conversion
#4huggingface/datasets
๐ค The largest hub of ready-to-use datasets for AI models with fast, easy-to-use and efficient data manipulation tools
#5m-bain/whisperX
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
#6modelscope/modelscope
ModelScope: bring the notion of Model-as-a-Service to life.
#7PaddlePaddle/models
Officially maintained, supported by PaddlePaddle, including CV, NLP, Speech, Rec, TS, big models and so on.
#8TalAter/annyang
๐ฌ Speech recognition for your site