back to home

altic-dev / FluidVoice

FluidVoice - Fastest macOS Offline Dictation app - Voice to Text fully Local. One ⭐ takes us a long way :))

1,443 stars
86 forks
41 issues
SwiftShell

AI Architecture Analysis

This repository is indexed by RepoMind. By analyzing altic-dev/FluidVoice in our AI interface, you can instantly generate complete architecture diagrams, visualize control flows, and perform automated security audits across the entire codebase.

Our Agentic Context Augmented Generation (Agentic CAG) engine loads full source files into context on-demand, avoiding the fragmentation of traditional RAG systems. Ask questions about the architecture, dependencies, or specific features to see it in action.

Source files are only loaded when you start an analysis to optimize performance.

Embed this Badge

Showcase RepoMind's analysis directly in your repository's README.

[![Analyzed by RepoMind](https://img.shields.io/badge/Analyzed%20by-RepoMind-4F46E5?style=for-the-badge)](https://repomind.in/repo/altic-dev/FluidVoice)
Preview:Analyzed by RepoMind

Repository Overview (README excerpt)

Crawler view

FluidVoice Fully open source voice-to-text dictation app for macOS with AI enhancement. **Get the latest release from here** > [!IMPORTANT] > This project is completely free and open source. If you find FluidVoice useful, please star the repository. It helps with visibility and motivates continued development. Your support means a lot. Star History Demo Command Mode - Take any action on your mac using FluidVoice https://github.com/user-attachments/assets/ffb47afd-1621-432a-bdca-baa4b8526301 Write Mode - Write/Rewrite text in ANY text box in ANY App on your mac https://github.com/user-attachments/assets/c57ef6d5-f0a1-4a3f-a121-637533442c24 Screenshots Command Mode Preview FluidVoice History New Features (v1.5) • **Overlay with Notch support** • **Command Mode** • **Write Mode** • **New History stats** • **Stats to monitor usage** Features • **Live Preview Mode**: Real-time transcription preview in overlay • **Multiple Speech Models**: Parakeet TDT v3 & v2, Apple Speech, and Whisper • **Real-time transcription** with extremely low latency • **AI enhancement** with OpenAI, Groq, and custom providers • **Global hotkey** for instant voice capture • **Smart typing** directly into any app • **Menu bar integration** for quick access • **Auto-updates** with seamless restart Supported Models Parakeet TDT v3 (Default) Optimized for Apple Silicon. Supports 25 languages with auto-detection: **Bulgarian, Croatian, Czech, Danish, Dutch, English, Estonian, Finnish, French, German, Greek, Hungarian, Italian, Latvian, Lithuanian, Maltese, Polish, Portuguese, Romanian, Slovak, Slovenian, Spanish, Swedish, Russian, Ukrainian.** Parakeet TDT v2 English-only model optimized for higher accuracy and consistency on Apple Silicon. Apple Speech Uses the native macOS Speech Recognition engine and Speech Analyzer API (macOS 26+). Whisper Universal support (runs on Intel & Apple Silicon). Supports 99 languages. Quick Start • Download the latest release • Move to Applications folder • Grant microphone and accessibility permissions when prompted • Set your preferred hotkey in settings • Optionally add an AI provider API key for enhanced transcription, keys are stored securely in your macOS Keychain. Make sure select "Always allow" for permissions Requirements • macOS 14.0 (Sonoma) or later • Apple Silicon Mac (M1, M2, M3, M4) • Intel Macs are supported from 1.5.1 builds using Whisper models! • Microphone access • Accessibility permissions for typing Join our small community to help us grow and give feedback :) ( Or just hang?!) https://discord.gg/VUPHaKSvYV Building from Source Build and run in Xcode. All dependencies are managed via Swift Package Manager. Contributing Contributions are welcome! Please create an issue first to discuss any major changes or feature requests before submitting a pull request. Setting Up Your Development Environment • **Clone the repository:** • **Open in Xcode:** • **Run from Xcode (one-time signing setup):** • Target: → • Enable • Pick your (Personal Team is fine) • This is stored in (gitignored), so it won’t affect your PR • **Build and run** - All dependencies are managed via Swift Package Manager • **Build only (no signing):** • **(Optional) Install pre-commit hook** to prevent accidental team ID commits: Pull Request Guidelines • **Keep changes focused and atomic** - one feature or fix per PR • **Update documentation** if adding new features • **Test thoroughly** on your machine before submitting • **Never commit personal team IDs or API keys** to • **Check git diff** before committing to ensure no personal settings leaked in Connect Follow development updates on X: @ALTIC_DEV Run integration dictation test If you run into a test bundle load error related to code signing/Team ID, run without overriding code signing flags (the command above), or explicitly: CI uses unsigned builds: Privacy & Analytics FluidVoice is designed to be local-first, but it includes optional anonymous analytics, solely to keep track of FV usage and future feature development. What this helps with • Feature usage patterns (for example dictation, command mode, and write mode) • Product reliability and performance tuning • Anonymous release health signals to prioritize fixes and improvements What is collected • App version, build, and macOS version • Low-cardinality feature/config flags (for example app mode and major settings) • Approximate usage ranges (not exact values) • High-level success/error outcomes What is not collected • Your voice, Raw audio or transcribed text • Selected text, prompts, or AI-generated responses • Terminal commands or outputs • Window titles, file names/paths, clipboard content, or typed content • or ANYTHING personal or private information. How to disable Analytics are enabled by default. You can disable or re-enable them at any time from: Why this exists Anonymous telemetry helps us understand what breaks, where performance can be improved, and which features matter most without collecting personal content. This helps us continue building what users want and if we should even continue developing features for you. License History • Versions before **2026-02-23**: Apache License 2.0 • Versions on and after **2026-02-23**: GNU General Public License v3.0 (GPLv3) License From 2026-02-23 onward, this project is licensed under the GNU General Public License, Version 3.0 (GPLv3). Versions published before this date were licensed under Apache License 2.0. ---