Best Open Source speech recognition Libraries

A curated list of the most popular GitHub repositories tagged with speech recognition. Select any project to visualize its architecture and dive into the codebase using RepoMind's AI engine.

#1huggingface/transformers

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

156,780Python

Analyze Code

#2ggml-org/whisper.cpp

Port of OpenAI's Whisper model in C/C++

46,889C++

Analyze Code

#3mozilla/DeepSpeech

DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.

26,724C++

Analyze Code

#4SYSTRAN/faster-whisper

Faster Whisper transcription with CTranslate2

21,060Python

Analyze Code

#5m-bain/whisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

20,249Python

Analyze Code

#6modelscope/FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

14,927Python

Analyze Code

#7NVIDIA/DeepLearningExamples

State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.

14,732Jupyter Notebook

Analyze Code

#8alphacep/vosk-api

Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node

14,256Jupyter Notebook

Analyze Code

#9kmario23/deep-learning-drizzle

Drench yourself in Deep Learning, Reinforcement Learning, Machine Learning, Computer Vision, and NLP by learning from these exciting lectures!!

12,796HTML

Analyze Code

#10PaddlePaddle/PaddleSpeech

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

12,533Python

Analyze Code

#11espnet/espnet

End-to-End Speech Processing Toolkit

9,742Python

Analyze Code

#12openvinotoolkit/openvino

OpenVINO™ is an open source toolkit for optimizing and deploying AI inference

9,732C++

Analyze Code

#13Uberi/speech_recognition

Speech recognition module for Python, supporting several engines and APIs, online and offline.

8,952Python

Analyze Code

#14TalAter/annyang

💬 Speech recognition for your site

6,672JavaScript

Analyze Code