sindresorhus / awesome-whisper

🔊 Awesome list for Whisper — an open-source AI-powered speech recognition system developed by OpenAI

2,222 stars

118 forks

4 issues

Chat with Codebase Architecture Scan Security Audit Explain Codebase

AI Architecture Analysis

This repository is indexed by RepoMind. By analyzing sindresorhus/awesome-whisper in our AI interface, you can instantly generate complete architecture diagrams, visualize control flows, and perform automated security audits across the entire codebase.

Our Agentic Context Augmented Generation (Agentic CAG) engine loads full source files into context on-demand, avoiding the fragmentation of traditional RAG systems. Ask questions about the architecture, dependencies, or specific features to see it in action.

Source files are only loaded when you start an analysis to optimize performance.

Click here to launch the interactive analysis workspace

Embed this Badge

Showcase RepoMind's analysis directly in your repository's README.

[![Analyzed by RepoMind](https://img.shields.io/badge/Analyzed%20by-RepoMind-4F46E5?style=for-the-badge)](https://repomind.in/repo/sindresorhus/awesome-whisper)

Preview:

Repository Overview (README excerpt)

Crawler view

Whisper is an open-source AI-powered speech recognition system developed by OpenAI Contents • Official • Model variants • Apps • Web apps • CLI tools • Playgrounds • Packages • Articles • Videos • Community • Third-party APIs • Related lists Official • Introduction • Source code • White paper Model variants • Whisper.cpp - Port of Whisper in C++. • Bindings for many languages • WhisperX - Adds fast automatic speaker recognition with word-level timestamps and speaker diarization. • faster-whisper - Faster reimplementation of Whisper using CTranslate2. • Whisper JAX - JAX implementation of Whisper for up to 70x speed-up on TPU. • whisper-timestamped - Adds word-level timestamps and confidence scores. • whisper-openvino - Whisper running on OpenVINO. • whisper.tflite - Whisper running on TensorFlow Lite. • Whisper variants - Various Whisper variants on Hugging Faces. • Whisper-AT - Whisper that can recognize non-speech audio events in addition to speech. Apps • Aiko - Audio transcription iOS and macOS app. • MacWhisper - Audio transcription macOS app. (Freemium) • Whisper Memos - Audio transcription iOS app. (Freemium) • FourYou - Audio journal iOS app. • Jojo Transcribe - Audio transcription macOS app. • Buzz - Audio transcription and translation macOS app. • WhisperScript - Audio transcription macOS app. (Freemium · Electron) • Audio Podium - Audio/video management macOS app. • superwhisper - Global audio transcription macOS menu bar app. • TypeWhisper - Local speech-to-text transcription for macOS and Windows with system-wide dictation. • Speech Note - Audio transcription Linux app. • FridayGPT - Dictation macOS app powered by OpenAI API. • EasyWhisper - Windows and macOS app for audio transcription and speaker diarization. (Freemium) • Audio Note - Real-time audio transcription on macOS and Windows. (Freemium · Electron) • Whisper - Android app for transcription and translation. (FOSS) • VoiceInk - Dictation and transcription macOS app. (FOSS) • Ito AI - AI voice dictation for Mac. (FOSS) • OpenSuperWhisper - Dictation app for macOS. (FOSS) • Screenpipe - 24/7 local screen and audio recording with AI search. (FOSS) Web apps Hosted • bigWav - Audio transcription and annotation tool. • Free Podcast Transcription - Runs locally in your browser. • Gladia - Transcription with real-time processing. • Whisper-Web - Local transcription using WebGPU, with optimised fine-tuned models for several languages. (FOSS) Self-hosted • Subs AI - Subtitle generation. • WaaS - GUI and API for Whisper. • writeout.ai - Laravel app to transcribe and translate audio files. • Meeper - Transcriptions, summary and more for meetings and any browser tab. (Chrome app) CLI tools • yt-whisper - YouTube subtitle generation. • phonix - Generate captions for videos. • whisper-standalone-win - Standalone Windows executable for Whisper and Faster Whisper. • whisper-ctranslate2 - Whisper command-line tool based on CTranslate2, compatible with the original. • insanely-fast-whisper-cli - Achieve transcription speeds near 30x real-time with several optimizations. • whisper-diarization - Automatic speech recognition with speaker diarization. • hns - On-device speech-to-text CLI using faster-whisper with automatic clipboard copy. Playgrounds • Hugging Faces - Whisper demo running on Hugging Faces. (Source) • Monster API - Whisper demo running on Monster API. (Source) • Web Whisper - Whisper demo by Pluja. (Source) • YouTube Video Transcription - Running on Colab. Packages JavaScript • use-whisper - React hook. Articles • Whispers of A.I.'s Modular Future - The future of machine learning lies in adaptable and accessible open-source speech-transcription programs. • How to Run Whisper Speech Recognition Model - Explains how to install and run the model, as well as providing a performance analysis comparing Whisper to other models. • Create your own speech to text app using Flask - The tutorial demonstrates Whisper's speech-to-text model, with a demo on running it in a Gradient Notebook and a guide for setting up a Flask app with Gradient Deployments. • Convert Podcasts to Text - Tutorial on the Whisper API with Python for speech-to-text transcription, showcasing GPU's faster transcription and advanced technology. Videos • Open AI's Whisper is Amazing! - Introduction to Whisper. • How to do Free Speech-to-Text Transcription Better Than Google Premium API - Tutorial. • Multilingual AI Speech Recognition Live App - Tutorial. Community • Discussions • Discord Third-party APIs *APIs that use Whisper.* • Whisper+ - Extension of the Whisper model which adds powerful features such as speaker identification custom vocabulary, summarization, and chapter generation. • Replicate - Use Whisper running on Replicate. Related lists • awesome-chatgpt - ChatGPT resources.