Handy
Open-source cross-platform dictation
The most repeated recommendation. README positions it as a free offline speech-to-text app for macOS, Windows, and Linux with auto-paste, configurable shortcuts, Whisper and Parakeet models. Code search also showed post-processing providers, custom words, history, translation settings, Raycast integration, and CLI flags.
22,814 stars · MIT License · latest v0.8.3 (2026-04-28)
Evidence and source comments
- README: offline operation, auto-paste, global shortcut, Whisper small/medium/turbo/large, Parakeet V3, custom Whisper models, VAD, Raycast, CLI.
- Code: post-process providers include OpenAI, Groq, and custom providers; custom words, history entries, and translation settings are present.
source 1 source 2 source 3 source 4 source 5 source 6 source 7 source 8 source 9 source 10 source 11
VoiceInk
Native macOS dictation
Native macOS voice-to-text focused on privacy and local AI. It is strong on local transcription, smart modes, personal dictionary, contextual formatting, and screen/app awareness, but is macOS-only.
5,141 stars · Other · latest v1.79 (2026-05-23)
Evidence and source comments
- README: 100 percent offline, local AI models, Power Mode app/URL detection, screen-content context, configurable shortcuts, personal dictionary, smart modes, and AI assistant.
- Technology list includes whisper.cpp and FluidAudio/Parakeet.
source 1 source 2 source 3 source 4 source 5 source 6
Muesli
Local-first macOS dictation and meetings
A rich local-first macOS option that goes beyond dictation into meeting capture, diarization, note generation, calendar context, exports, and model management.
537 stars · MIT License · latest v0.6.9 (2026-05-27)
Evidence and source comments
- README: on-device speech-to-text, mic plus system audio meeting recording, VAD chunking, speaker diarization, meeting detection, Google Calendar integration, PDF/Markdown export, templates, filler removal, personal dictionary, configurable hotkeys, CLI.
- Models/providers: Parakeet TDT, Cohere Transcribe, WhisperKit, Qwen3; AI notes via BYOK OpenAI/OpenRouter/ChatGPT OAuth/local Ollama.
source 1 source 2 source 3 source 4
OpenWhispr
Cross-platform dictation and AI notes
Cross-platform open-source app aimed at voice-to-text, meetings, AI notes, semantic search, and local/cloud model choice. One of the more feature-complete options on paper.
3,472 stars · MIT License · latest v1.7.2 (2026-05-20)
Evidence and source comments
- README/site: macOS, Windows, Linux; local or cloud; Whisper and Parakeet; speaker diarization; meeting transcription; Google Calendar; notes, semantic search, API/MCP.
- Public releases exist for multiple platforms.
source 1 source 2 source 3 source 4 source 5 source 6
thinkur
macOS dictation and meeting transcripts
macOS-only, local-first dictation and meeting recorder. The source comment mentioned local models, but the current README/site primarily describe Apple's Speech framework, meeting transcripts, speaker labels, search, MCP, and per-app styles.
19 stars · MIT License
Evidence and source comments
- README/site: free open-source macOS voice-to-text, dictation plus meeting recording, 100 percent local, Apple Speech framework, speaker labels, searchable transcripts, MCP, smart post-processing, self-correction, per-app styles.
- Constraint: macOS 15+ and Apple Silicon.
source 1
Freestyle Voice
Source-available local-first dictation
Local-first dictation for macOS, Windows, and Linux. README emphasizes cloud-provider choice, cleaning, dictionary, contextual correction, and cross-platform releases; code search showed local Whisper support and local LLM config.
126 stars · MIT License · latest 0.0.10 (2026-05-31)
Evidence and source comments
- README: hotkey, paste at cursor, providers OpenAI/Groq/Anthropic/Google/Deepgram/ElevenLabs, transcription cleaning, dictionary, contextual correction, macOS/Windows/Linux releases.
- Code search: whisper.cpp download/use, history, dictionary, context-aware dictation, local LLM config.
source 1 source 2 source 3
Amical
Local-first dictation and note-taking
Open-source AI dictation and note-taking with context-aware dictation, active-app formatting, hotkeys, local models, and a native helper. Meeting transcription and MCP are marked as planned in its own feature table.
1,292 stars · MIT License · latest v1.7.1 (2026-05-26)
Evidence and source comments
- README: local-first, Whisper/Ollama, context-aware dictation, active app formatting, extensible hotkeys/voice macros/custom workflows, local/offline models, floating widget.
- README feature table marks MCP and meeting transcription as planned/partial rather than done. Code search showed paste helper, shortcuts, audio mute/restore, and selected-text context.
source 1
MacParakeet
Apple Silicon Parakeet dictation and transcription
Mac app centered on Parakeet running locally via FluidAudio/ANE. It is unusually broad for transcription input: system dictation, file/URL/YouTube transcription, meeting recording, calendar reminders, text cleanup, replacements, snippets, summaries, chat, and CLI.
285 stars · Other · latest v0.6.16 (2026-05-31)
Evidence and source comments
- README: Parakeet TDT on ANE via FluidAudio, system-wide dictation, file/URL/YouTube transcription, meeting recording, calendar reminders/autostart, text cleanup, replacements/snippets, summaries/chat/formatter/transforms.
- Also supports optional WhisperKit; Apple Silicon and macOS 14.2+.
source 1
FluidVoice
macOS dictation with command mode
Open-source macOS voice-to-text with strong local model support and optional AI enhancement. It has live preview, smart typing into any app, command mode, write mode, and history/stats.
2,269 stars · GNU General Public License v3.0 · latest v1.5.14 (2026-05-29)
Evidence and source comments
- README: Parakeet Flash/v3/v2, Cohere Transcribe, Apple Speech, Whisper; live preview; low latency; OpenAI/Groq/custom providers for enhancement; global hotkey; smart typing into any app; menu bar; command mode; write mode.
- macOS 15+, Apple Silicon preferred; Intel supported with Whisper models in later versions.
source 1
OpenLess
macOS and Windows voice input
Open-source voice input for macOS and Windows with hotkey dictation, cursor insertion, AI polishing modes, translation, selection QA, history, vocabulary, and both local and cloud ASR paths.
1,931 stars · MIT License · latest v1.3.4-tauri (2026-05-22)
Evidence and source comments
- README: macOS and Windows; hotkey; AI-polished text; insert at cursor; clipboard fallback; cloud ASR via Volcengine/OpenAI Whisper-compatible/Apple Speech; local ASR via Qwen3-ASR and Windows Foundry Local Whisper.
- README: polish providers Ark/DeepSeek/OpenAI/Doubao/Anthropic-compatible/custom; raw/light/structured/formal modes; translation hotkey; selection-ask QA panel; history, vocab, settings, hotwords.
source 1
Typeflux
macOS local-first dictation and Ask Anything
macOS voice input with hold-Fn dictation and double-press Ask Anything. It combines local STT options with many cloud providers, custom personas, streaming preview, history, translation, and export.
271 stars · GNU Affero General Public License v3.0
Evidence and source comments
- README: hold Fn to dictate, double press for Ask Anything, text into any macOS app, local-first/privacy-first, custom personas, multiple speech backends.
- Local models include SenseVoice, FunASR, WhisperKit, Qwen3-ASR; cloud options include Alibaba, Doubao, Google, OpenAI, Groq, and more.
source 1
type4me
macOS voice input with many providers
macOS voice input with built-in local recognition, cloud providers, streaming recognition, prompt templates, hotwords/snippets, local LLM options, and history export.
1,226 stars · MIT License · latest v1.9.3 (2026-04-26)
Evidence and source comments
- README: built-in local recognition plus cloud providers; streaming recognition; text polish, prompt optimization, translation, custom templates; Ollama local LLM; hotwords/snippet replacements; history export.
- Local version mentions SenseVoice and Qwen3-ASR; architecture lists 13 LLM providers.
source 1
macOS voice input: hold a hotkey, transcribe locally, refine with LLM, and auto-paste. License metadata is 'Other', so treat it as public-source/source-available rather than cleanly OSI-open.
266 stars · Other · latest v0.6.1 (2026-05-28)
Evidence and source comments
- GitHub metadata description: hold a hotkey to record, release to transcribe locally via STT, refine with LLM, and auto-paste into the active text field.
- README/repo includes app flow around local STT, LLM refinement, history, custom words, and macOS integration.
source 1
Speed of Sound
Linux desktop voice typing
A Linux-first voice typing app with offline local models, direct typing into apps, tray/global shortcut support, optional LLM polishing, and custom context/vocabulary.
144 stars · MIT License · latest v0.14.0 (2026-05-09)
Evidence and source comments
- README: Linux desktop voice typing, offline transcription via Whisper/Parakeet/Canary and other models, global shortcut/system tray/button, X11/Wayland portals, multi-language.
- Optional LLM polishing via Anthropic/Google/OpenAI or self-hosted vLLM/Ollama/llama.cpp; supports custom context/vocab.
source 1
MoFA-IME
macOS local voice input method
macOS resident voice input: hold Fn, locally transcribe, optionally polish with local LLM, and write into the current input field. It also includes model management and history/clipboard utilities.
4 stars · Apache License 2.0 · latest 0.3.0 (2026-02-23)
Evidence and source comments
- README: hold fn to local transcribe, optional local LLM polish, writes into input field, global hotkey through CGEventTap, menu bar, floating UI, history window.
- Local chain: Whisper ASR and Qwen GGUF through llama.cpp FFI; model management GUI and clipboard manager.
source 1
VOICE2TYPE
Windows tray voice input
Windows tray voice input with hotkey/toggle modes, clipboard paste or keyboard injection, Groq/SiliconFlow/local Whisper engines, and experimental real-time subtitles.
38 stars · MIT License · latest v0.0.51-the-last-pure-vision (2026-05-26)
Evidence and source comments
- README: global hotkey, hold-to-talk/toggle, clipboard paste/keyboard injection, SiliconFlow/Groq/local Whisper, filtering punctuation/emoji, re-paste last, autostart/update/logs.
- Experimental real-time subtitles from speaker/mic audio and bilingual subtitles through LLM providers.
source 1
Whisp
macOS local Kyutai dictation
Small macOS push-to-talk app that runs locally on Apple Silicon using Kyutai STT via MLX. It is privacy-focused but intentionally minimal compared with the feature-heavy alternatives.
0 stars
Evidence and source comments
- README: hold Fn, locally on Apple Silicon using Kyutai STT through MLX, menu bar, no cloud, macOS 13+, Apple Silicon, roughly 3 GB model.
source 1
whispy
Serverless Groq dictation
A simple serverless dictation app for macOS and Windows using Groq Whisper Large v3, optional OpenAI cleanup, hotkeys, auto-paste, and tray mode. The author explicitly said it does not support local models.
2 stars
Evidence and source comments
- README: Groq Whisper Large v3, optional OpenAI cleanup, hotkeys, auto-paste, tray, Windows/macOS.
- Thread reply from author: it does not support local models.
source 1
Invoke
Voice command layer
More of a local-first voice command/action layer than a Wispr Flow clone. It maps speech to structured actions across apps using local STT/LLM plus Composio integrations.
3 stars
Evidence and source comments
- README: desktop/Windows/Android voice command layer, natural speech to structured actions, local Whisper tiny STT, Qwen 3 0.6B through Ollama, Composio integrations for Gmail/GitHub/Slack/Calendar/Notion/Todoist/Docs/search.
- Also includes writing cleanup, snippets, dictionary, style presets, privacy mode, Android voice bubble, Windows Tauri desktop.
source 1
TMSpeech
Windows real-time subtitles
Windows real-time Chinese subtitle and meeting transcription tool. It is adjacent to dictation rather than a cursor-insertion replacement, but relevant for local meeting speech capture.
1,332 stars · MIT License · latest v0.4.3 (2025-12-15)
Evidence and source comments
- README: Windows realtime Chinese subtitles from system audio via WASAPI loopback, lyric-style subtitles, meeting live transcription, meeting minutes saved to file, sherpa-onnx streaming model.
source 1
BreezeApp
Mobile offline AI app
Mobile offline AI platform for Android/iOS with speech-to-text among other modalities. Relevant to voice transcription, but not a desktop system-wide dictation app.
221 stars · Apache License 2.0 · latest v2.0.0 (2025-08-05)
Evidence and source comments
- README: Android/iOS, offline speech-to-text, text-to-speech, chat, image QA, privacy/offline mobile AI platform.
source 1
Speech-to-clipboard
Linux speech-to-clipboard script
Small Linux script that records audio, sends it to ElevenLabs STT, cleans incidental text, and writes the result to the clipboard. Useful but much narrower than a full app.
0 stars · MIT License
Evidence and source comments
- README: Linux script, speech to clipboard, ElevenLabs API, can be bound to a hotkey, includes lightweight cleanup; local model support would require modification.
source 1
Shuo
Closed app, public website repo
The tweet listed Shuo, but also said the app is work-in-progress and will not be open source in the near future. The cloned repo is only the website, so product claims are less inspectable.
0 stars · Apache License 2.0
Evidence and source comments
- Website README: professional local voice input method for macOS/Windows, local voice recognition, custom AI providers, auto-structuring, colloquial filtering, privacy.
- Thread caveat: app itself is not expected to be open source soon.
source 1