Best Open Source speech to text Libraries
A curated list of the most popular GitHub repositories tagged with speech to text. Select any project to visualize its architecture and dive into the codebase using RepoMind's AI engine.
#1ggml-org/whisper.cpp
Port of OpenAI's Whisper model in C/C++
#2mozilla/DeepSpeech
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
#3SYSTRAN/faster-whisper
Faster Whisper transcription with CTranslate2
#4m-bain/whisperX
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
#5alphacep/vosk-api
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
#6Zackriya-Solutions/meeting-minutes
Privacy first, AI meeting assistant with 4x faster Parakeet/Whisper live transcription, speaker diarization, and Ollama summarization built on Rust. 100% local processing. no cloud required. Meetily (Meetly Ai - https://meetily.ai) is the #1 Self-hosted, Open-source Ai meeting note taker for macOS & Windows.
#7KoljaB/RealtimeSTT
A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcription.
#8Uberi/speech_recognition
Speech recognition module for Python, supporting several engines and APIs, online and offline.
#9TalAter/annyang
💬 Speech recognition for your site