Best Open Source speech to text Libraries

A curated list of the most popular GitHub repositories tagged with speech to text. Select any project to visualize its architecture and dive into the codebase using RepoMind's AI engine.

#1ggml-org/whisper.cpp

Port of OpenAI's Whisper model in C/C++

46,889C++

Analyze Code

#2mozilla/DeepSpeech

DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.

26,724C++

Analyze Code

#3SYSTRAN/faster-whisper

Faster Whisper transcription with CTranslate2

21,060Python

Analyze Code

#4m-bain/whisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

20,249Python

Analyze Code

#5alphacep/vosk-api

Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node

14,256Jupyter Notebook

Analyze Code

#6Zackriya-Solutions/meeting-minutes

Privacy first, AI meeting assistant with 4x faster Parakeet/Whisper live transcription, speaker diarization, and Ollama summarization built on Rust. 100% local processing. no cloud required. Meetily (Meetly Ai - https://meetily.ai) is the #1 Self-hosted, Open-source Ai meeting note taker for macOS & Windows.

9,916Rust

Analyze Code

#7KoljaB/RealtimeSTT

A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcription.

9,477Python

Analyze Code

#8Uberi/speech_recognition

Speech recognition module for Python, supporting several engines and APIs, online and offline.

8,952Python

Analyze Code

#9TalAter/annyang

💬 Speech recognition for your site

6,672JavaScript

Analyze Code