SuperWhisper
AI-powered dictation with LLM reformatting

SuperWhisper — Official Website
Quick Take: SuperWhisper
SuperWhisper is the best local voice-to-text app for macOS. Whisper models deliver excellent accuracy, everything runs on-device for privacy, and the intelligent modes transform raw dictation into structured text. The requirement for Apple Silicon is limiting but understandable—the Neural Engine is what makes real-time local transcription possible. The subscription cost ($8/month) is the main friction point for casual users. For anyone who regularly dictates text, writes long-form content, or needs to transcribe recordings with privacy guarantees, SuperWhisper is a significant productivity tool.
Best For
- •Writers and Professionals Who Dictate Regularly
- •Privacy-Conscious Users Who Want Local Speech Recognition
- •Anyone with RSI or Physical Limitations That Make Typing Painful
What is SuperWhisper?
SuperWhisper is a voice-to-text app for macOS that runs OpenAI's Whisper speech recognition models locally on your Mac. You press a hotkey, speak, and your words appear as text in whatever app has focus. The transcription runs on your machine using the Neural Engine and GPU—your audio never leaves your computer. What makes SuperWhisper different from macOS's built-in dictation or cloud-based speech recognition is the combination of accuracy, privacy, and speed. Whisper models are among the most accurate speech recognition systems available—they handle accents, technical jargon, code-related terminology, and natural speech patterns better than Apple's built-in dictation. Because processing happens locally, there's no internet latency, no data uploaded to servers, and no privacy concerns about your spoken content being processed in the cloud. SuperWhisper was created by Sascha (indie developer) and runs exclusively on Apple Silicon Macs (M1 and later). The Neural Engine in M-series chips is what makes local Whisper inference practical—transcription happens at near-real-time speed. On an M1 MacBook Pro, short utterances transcribe in under a second. Longer dictation (a full paragraph) takes 2-4 seconds depending on the model size. The app supports multiple Whisper model sizes. Smaller models (tiny, base) transcribe faster but with slightly lower accuracy. Larger models (small, medium, large-v3) are more accurate but take longer. You choose the trade-off that works for your hardware and accuracy needs. On M2 Pro/Max and M3 machines, even the large model runs at practical speeds. SuperWhisper also includes AI post-processing modes that can clean up your transcription—removing filler words, fixing punctuation, formatting as bullet points, or transforming your speech into structured text. These features use OpenAI's API (cloud-based, requires an API key) or local LLMs via Ollama. The core transcription always runs locally; the AI modes are optional and clearly separated. For anyone who types slower than they think, or whose hands get tired from long writing sessions, SuperWhisper turns speech into text with remarkable accuracy and zero privacy compromise.
Install with Homebrew
brew install --cask superwhisperDeep Dive: Local AI Speech Recognition on Apple Silicon
How Whisper models run on Mac hardware and what it means for privacy-first voice input.
History & Background
OpenAI released Whisper in September 2022 as an open-source speech recognition model. Trained on 680,000 hours of multilingual audio data from the internet, Whisper matched or exceeded commercial speech recognition services in accuracy across multiple languages. The model weights are publicly available, which enabled apps like SuperWhisper to run inference locally. Apple Silicon's Neural Engine (a dedicated machine learning accelerator) and Core ML (Apple's ML framework) made it practical to run Whisper at real-time speeds on consumer hardware.
How It Works
SuperWhisper converts Whisper's PyTorch model weights to Core ML format for optimized inference on Apple Silicon. The Neural Engine handles the transformer computation, while the GPU assists with audio preprocessing (mel spectrogram generation). The pipeline: microphone captures audio → audio is processed into mel spectrograms → the Whisper model (running on Neural Engine) generates text tokens → tokens are decoded into text → optional AI post-processing formats the text. The entire pipeline runs locally except for optional cloud-based post-processing.
Ecosystem & Integrations
Local AI on Mac is a rapidly growing category. Ollama runs LLMs locally. LM Studio provides a GUI for local language models. SuperWhisper runs Whisper for speech-to-text. Together, these apps represent a shift toward privacy-preserving AI that doesn't require cloud services. Apple's own investments in on-device AI (Apple Intelligence, on-device dictation) validate the approach. The ecosystem is early but growing fast.
Future Development
SuperWhisper's roadmap includes support for newer Whisper model variants (Whisper v4 when released), improved real-time streaming transcription (currently the transcription has a slight delay), deeper integration with macOS Shortcuts and Automator, and potentially Apple Intelligence integration. The broader trend of on-device AI will continue to make local speech recognition faster and more accurate.
Key Features
Local Whisper Transcription
SuperWhisper runs OpenAI's Whisper models entirely on your Mac using Core ML and the Neural Engine. No internet connection required for transcription. No audio sent to external servers. Supported models range from tiny (fastest, 39MB) to large-v3 (most accurate, 3.1GB). Model switching is available in the app—download the models you want and select based on your accuracy/speed preference.
Intelligent Modes
Beyond raw transcription, SuperWhisper offers modes that process your speech differently. 'Dictation' mode transcribes verbatim. 'Note' mode cleans up filler words and adds punctuation. 'Email' mode structures your speech into an email format. 'Code' mode recognizes programming terminology and formats appropriately. Custom modes let you define your own processing rules using system prompts.
Custom Vocabulary & Context
Add custom vocabulary terms—company names, technical jargon, project names, acronyms—that Whisper might not recognize out of the box. SuperWhisper uses these terms as context hints during transcription, improving accuracy for domain-specific language. If you work in a specialized field (medicine, law, engineering), custom vocabulary makes a measurable difference.
Global Hotkey Integration
Press a configurable hotkey from any application to start transcription. Speak your text, press the hotkey again (or release it in push-to-talk mode), and the transcribed text is inserted at the cursor position. Works with any app that accepts text input—email clients, messaging apps, code editors, note-taking tools, browsers. The hotkey works system-wide, making dictation accessible from anywhere.
File Transcription
SuperWhisper can transcribe audio and video files (MP3, WAV, M4A, MP4) locally using the same Whisper models. Drag an audio file onto the app, and it produces a text transcript. This is useful for meeting recordings, podcast transcription, and converting voice memos to text—all processed locally on your Mac.
API Key Support
For AI post-processing modes that go beyond basic transcription (email formatting, summarization, style transformation), SuperWhisper supports OpenAI API keys. You can also point it at local LLMs via Ollama or any OpenAI-compatible API. The core transcription never uses the API—it always runs locally. The API is only for optional text post-processing.
Who Should Use SuperWhisper?
1Writer with RSI
A technical writer develops repetitive strain injury from years of typing. SuperWhisper becomes their primary text input method. They dictate articles, documentation, and emails by speaking naturally. The 'Note' mode cleans up their speech into well-punctuated paragraphs. Custom vocabulary includes technical terms specific to their documentation domain. Writing output stays high while typing strain drops to near zero.
2Developer Writing Documentation
A developer needs to write documentation for an API but types slowly and dislikes writing prose. They switch to SuperWhisper, dictate their explanations while looking at the code, and let the app transcribe. The 'Code' mode handles technical terms correctly. They edit the transcribed text for clarity—faster than typing the whole thing from scratch.
3Meeting Note Taker
After a Zoom call, a project manager drags the meeting recording into SuperWhisper. The large-v3 model transcribes the entire 45-minute recording locally. They use an AI mode to extract action items and summarize key decisions. No meeting audio is uploaded to the cloud—important for discussions involving sensitive business information.
How to Install SuperWhisper on Mac
SuperWhisper requires an Apple Silicon Mac (M1 or later) and is available via Homebrew or direct download.
Install via Homebrew
Run `brew install --cask superwhisper`. The app installs to your Applications folder.
Grant Microphone Permission
On first launch, macOS will request microphone access. Grant it in System Settings > Privacy & Security > Microphone.
Download a Whisper Model
SuperWhisper will prompt you to download a Whisper model. Start with 'small' for a good balance of speed and accuracy. You can download additional models later. Model sizes range from 39MB (tiny) to 3.1GB (large-v3).
Set Global Hotkey
Configure your transcription hotkey in preferences. The default varies—set it to a key combination you don't use in other apps (e.g., Fn key, right Option, or a specific function key).
Pro Tips
- • Start with the 'small' model and upgrade to 'medium' or 'large-v3' if accuracy isn't sufficient for your use case.
- • Push-to-talk mode (hold hotkey while speaking, release to transcribe) works well for short utterances. Toggle mode (press to start, press to stop) is better for longer dictation.
- • Set up your custom vocabulary before your first real use—add company names, project names, and technical terms.
Configuration Tips
Choose the Right Model for Your Hardware
On M1/M2 base chips, the 'small' model offers the best speed/accuracy trade-off. On M2 Pro/Max, M3, and M4 chips, the 'medium' or 'large-v3' models run fast enough for practical use. Test each model with your typical speech (technical terms, accent, speaking speed) to find the sweet spot.
Build Your Custom Vocabulary
Add terms you use frequently that Whisper might not recognize: company names (bundl, GoodFolks), product names (OrbStack, Ghostty), technical terms (kubectl, nginx, PostgreSQL), and acronyms (CICD, PR, SSO). SuperWhisper uses these as transcription hints. The more specific your vocabulary, the better the accuracy.
Alternatives to SuperWhisper
Voice-to-text options on macOS range from built-in features to specialized apps.
macOS Dictation (Built-in)
Apple's built-in dictation is free and works system-wide. Recent versions use on-device processing for English on Apple Silicon, which is similar to SuperWhisper's local approach. The accuracy is good for general speech but weaker for technical jargon and accented English. SuperWhisper's Whisper models are more accurate overall, and custom vocabulary support handles specialized terminology that Apple's dictation misses.
Whisper Transcription
An open-source macOS app that also runs Whisper locally. It provides basic transcription without the intelligent modes, custom vocabulary, or polish of SuperWhisper. If you want a free, no-frills Whisper interface, it's an option. SuperWhisper adds the workflow features (modes, vocabulary, file transcription) that make daily use practical.
Otter.ai
A cloud-based transcription service that excels at meeting transcription with speaker identification. Otter sends all audio to its cloud for processing—a significant privacy difference from SuperWhisper's local approach. Otter is better for multi-speaker meeting transcription. SuperWhisper is better for personal dictation where privacy matters.
Pricing
SuperWhisper offers a free trial period. The subscription is approximately $8/month or $60/year. The subscription covers app updates and access to all Whisper models. No additional charges for local transcription—transcription uses your Mac's hardware. AI post-processing modes that use OpenAI's API require your own API key (billed directly by OpenAI based on usage).
Pros
- ✓Whisper models provide industry-leading transcription accuracy
- ✓Fully local transcription — audio never leaves your Mac
- ✓Works offline with no internet connection required for core features
- ✓Multiple model sizes let you trade speed for accuracy
- ✓Custom vocabulary improves accuracy for domain-specific terms
- ✓File transcription handles meeting recordings and voice memos
- ✓Intelligent modes clean up speech into structured text
Cons
- ✗Requires Apple Silicon Mac (M1 or later) — no Intel Mac support
- ✗Large Whisper models (medium, large-v3) need several GB of storage
- ✗AI post-processing modes require an OpenAI API key (cloud-based)
- ✗Subscription model ($8/month) for a dictation app may feel expensive
- ✗Transcription of longer audio has a noticeable delay (2-4 seconds)
- ✗Accuracy still depends on microphone quality and background noise
Community & Support
SuperWhisper has a growing user community, primarily on Twitter/X where the developer shares updates and responds to feedback. The app's website provides documentation and FAQs. Users discuss SuperWhisper in productivity communities on Reddit (r/macapps, r/productivityapps) and in the broader Whisper model community. The developer is responsive to feature requests and bug reports. As local AI apps grow in popularity on Apple Silicon, SuperWhisper has become one of the most-recommended voice-to-text solutions for privacy-conscious Mac users.
Video Tutorials
Getting Started with SuperWhisper
More Tutorials
Getting Started with superwhisper
Superwhisper • 41.4K views
Superwhisper Review - 2025 | Smartest Voice-to-Text AI for Mac - Best Free AI Dictation Tool?
Daniel | Tech & Data • 36.6K views
Best SuperWhisper Speech To Text App For iPhone? - Spokenly Tutorial
Rish - AI Business Automation • 1.0K views
Frequently Asked Questions about SuperWhisper
Our Verdict
SuperWhisper is the best local voice-to-text app for macOS. Whisper models deliver excellent accuracy, everything runs on-device for privacy, and the intelligent modes transform raw dictation into structured text. The requirement for Apple Silicon is limiting but understandable—the Neural Engine is what makes real-time local transcription possible. The subscription cost ($8/month) is the main friction point for casual users. For anyone who regularly dictates text, writes long-form content, or needs to transcribe recordings with privacy guarantees, SuperWhisper is a significant productivity tool.
About the Author
Productivity & Workflow Analyst
Related Technologies & Concepts
Related Topics
Sources & References
Fact-CheckedLast verified: Feb 23, 2026
Key Verified Facts
- SuperWhisper uses OpenAI Whisper models and the whisper.cpp framework for on-device speech-to-text processing.
- 1SuperWhisper
Accessed Feb 23, 2026
Research queries: SuperWhisper Mac voice to text 2026