Overview
Parakeet is NVIDIA’s state-of-the-art speech recognition model, optimized for Apple Silicon via CoreML. It achieves significantly better accuracy than Whisper while running locally on your Mac’s Neural Engine — no internet required. Key Benefits:- Best Accuracy — 6% word error rate (vs Whisper’s 7.4%)
- 100% Private — Audio never leaves your Mac
- Offline — Works without internet connection
- Free — No API keys, no usage costs
- Fast — Optimized for Apple Neural Engine
Privacy & Security
Parakeet processes everything locally on your Mac:- Audio is captured and transcribed on-device
- No data sent to cloud services
- No internet connection required
- No API keys or accounts needed
- Perfect for sensitive or confidential content
No API Key Required
Like WhisperKit, Parakeet requires no setup:- Select Parakeet in Stenox Settings
- Download your preferred model
- Start dictating immediately
Available Models
| Model | Size | Speed | Accuracy | Languages |
|---|---|---|---|---|
| Parakeet v2 | ~2.5 GB | Very Fast | Best | English only |
| Parakeet v3 | ~2.5 GB | Very Fast | Best | 25+ languages |
Recommended: Parakeet v2 for English-only use. Parakeet v3 if you need multilingual support.
Model Selection Guide
Parakeet v2 - Best for English
Parakeet v2 - Best for English
Best for:
- English-only dictation
- Highest accuracy available
- Professional writing and technical content
Parakeet v3 - Multilingual
Parakeet v3 - Multilingual
Best for:
- Non-English languages (25+ supported)
- Multilingual dictation
- Same accuracy as v2 with language flexibility
Setup Instructions
Choose a model
Select your preferred model:
- Parakeet v2 (recommended for English)
- Parakeet v3 (multilingual)
Download the model
Click Download and wait for the model to download (~2.5 GB).The model is stored locally and optimized for CoreML on first use.
Performance
Parakeet runs on your Mac’s Apple Neural Engine for fast, efficient processing:- Processing speed: ~190x real-time on Apple Silicon (M-series chips)
- Typical latency: 2-3 seconds for 10 seconds of audio
- First use: May take a minute to compile the CoreML model (one-time)
Apple Silicon required. Parakeet uses CoreML and the Apple Neural Engine, which are only available on M1 and later Macs.
Storage Requirements
- Model size: ~2.5 GB per model
- Location: Managed by FluidAudio framework
Parakeet vs WhisperKit
| Feature | Parakeet | WhisperKit |
|---|---|---|
| Accuracy | Best (6% WER) | Good (7.4% WER) |
| Speed | Very Fast | Fast |
| Model Size | ~2.5 GB | 142 MB - 483 MB |
| Languages | English + 25+ | 100+ |
| Intel Mac | No | Yes |
| Neural Engine | Yes (CoreML) | Yes (CoreML) |
When to Use Parakeet
Best local accuracy
State-of-the-art transcription running entirely on your Mac.
Privacy is critical
Healthcare, legal, financial, or any sensitive content.
Working offline
Airplanes, remote locations, or unstable internet.
Professional use
High-accuracy transcription for professional writing and documentation.
Troubleshooting
Model download fails
Model download fails
- Check your internet connection
- Ensure you have at least 3 GB of free disk space
- Try downloading again — downloads support resuming
First transcription is slow
First transcription is slow
- The CoreML model needs to compile on first use — this is a one-time process
- Subsequent transcriptions will be much faster
- You’ll see a “Compiling model” toast while this happens
Wrong language being transcribed
Wrong language being transcribed
- For English, use Parakeet v2 (English-optimized)
- For other languages, use Parakeet v3 and set language in Profile settings
Poor accuracy for technical terms
Poor accuracy for technical terms
- Add terms to Vocabulary for custom replacements
- Parakeet v2 generally handles technical terms well for English

