Overview
WhisperKit is Apple’s optimized implementation of OpenAI’s Whisper model, designed to run locally on Apple Silicon Macs. It provides completely private, offline transcription without requiring an internet connection or API key. Key Benefits:- 🔒 100% Private - Audio never leaves your Mac
- 🌐 Offline - Works without internet connection
- 💰 Free - No API keys, no usage costs
- 🚀 Optimized - Built for Apple Silicon
Privacy & Security
WhisperKit processes everything locally on your Mac:- Audio is captured and transcribed on-device
- No data sent to cloud services
- No internet connection required
- No API keys or accounts needed
- Perfect for sensitive or confidential content
No API Key Required
Unlike cloud providers, WhisperKit requires no setup:- Select WhisperKit in Stenox Settings
- Download your preferred model
- Start dictating immediately
Available Models
WhisperKit offers several model sizes with different trade-offs:| Model | Size | Speed | Accuracy | Recommended For |
|---|---|---|---|---|
| base.en | 142 MB | Fast | Good | English-only, quick setup |
| small.en | 483 MB | Medium | Better | English-only, better accuracy |
| base | 142 MB | Fast | Good | Multilingual (100+ languages) |
| small | 483 MB | Medium | Better | Multilingual, better accuracy |
Recommended for beginners:
base.en - Fastest download and processing with good accuracy for English.Model Selection Guide
base.en - Fast and lightweight
base.en - Fast and lightweight
Best for:
- Quick setup (smallest download)
- Fast processing (2-3 seconds)
- English-only dictation
- Limited disk space
small.en - Better accuracy
small.en - Better accuracy
Best for:
- Higher accuracy needs
- Professional writing
- Technical terminology
- English-only dictation
base / small (Multilingual)
base / small (Multilingual)
Best for:
- Non-English languages
- Multilingual dictation
- Code-switching between languages
Setup Instructions
1
Open Stenox Settings
Click the Stenox icon in your menu bar and select Settings.
2
Navigate to Models tab
Go to the Models tab in the Settings window.
3
Select WhisperKit
Under Transcription Provider, select WhisperKit from the dropdown.
4
Choose a model
Select your preferred model:
- base.en (recommended for beginners)
- small.en (better accuracy)
- base or small (multilingual)
5
Download the model
Click Download and wait for the model to download.Download sizes:
- base.en / base: ~142 MB
- small.en / small: ~483 MB
~/stenox-models/whisper/ by default.6
Start dictating
Once download completes, you can start using WhisperKit immediately!
Performance
WhisperKit performance depends on your Mac’s hardware:Apple Silicon Macs
Excellent performance:- base.en: 2-3 seconds per 10 seconds of audio
- small.en: 4-6 seconds per 10 seconds of audio
- Optimized with CoreML and ANE (Apple Neural Engine)
Intel Macs
Slower performance:- 2-3x slower than Apple Silicon
- CPU-only processing
- Consider cloud providers for faster results
Storage Requirements
Models are stored locally on your Mac:- Location:
~/stenox-models/whisper/ - base.en / base: ~142 MB
- small.en / small: ~483 MB
- medium: ~1.5 GB (if you download it manually)
Language Support
WhisperKit supports 100+ languages:- English Models (.en)
- Multilingual Models
- Optimized for English only
- Slightly faster and more accurate for English
- Use
base.enorsmall.en
When to Use WhisperKit
Privacy is critical
Healthcare, legal, financial, or any sensitive content.
Working offline
Airplanes, remote locations, or unstable internet.
No API costs
No per-minute charges. Processing happens on your Mac.
Learning or testing
Try Stenox with zero setup or commitment.
When to Use Cloud Instead
Consider cloud providers if you need:- Faster processing - Cloud is 5-10x faster (< 1 second vs 3-5 seconds)
- Higher accuracy - DeepGram Nova-3 and AssemblyAI often outperform Whisper
- Intel Mac - Cloud providers work equally well on all Macs
You can use both WhisperKit and cloud providers! Create different Profiles for different scenarios.
Troubleshooting
Model download fails
Model download fails
- Check your internet connection
- Ensure you have enough disk space (~500 MB free)
- Try downloading again
- Check
~/stenox-models/whisper/for partial downloads and delete them
Transcription is very slow (> 10 seconds)
Transcription is very slow (> 10 seconds)
- You may be on an Intel Mac (slower performance expected)
- Try
base.eninstead ofsmall.enfor faster processing - Close other intensive applications
- Consider using a cloud provider for speed
Wrong language being transcribed
Wrong language being transcribed
- For English, use
.enmodels (base.en or small.en) - For other languages, use multilingual models and set language in Profile settings
- Auto-detect may choose wrong language - specify explicitly
Poor accuracy for technical terms
Poor accuracy for technical terms
- Add terms to Vocabulary for custom replacements
- Try
small.enfor better accuracy - Consider cloud providers (DeepGram, AssemblyAI) for specialized vocabulary

