Skip to main content
“og:title”: “Stenox Docs”

Fast, High-Quality Cloud Setup

Cloud providers offer the fastest processing and highest accuracy for Stenox dictation. This guide shows you the recommended configurations for optimal results.
Most recommended: Stenox Cloud — fine-tuned for Stenox, zero configuration, fastest setup. Included with Plus and Lifetime Plus.

Why Use Cloud Providers?

Ultra-fast processing

Sub-second transcription and enhancement (5-10x faster than local).

Higher accuracy

State-of-the-art models optimized for production use.

No local storage

No need to download multi-GB models.

Works on any Mac

Intel and Apple Silicon, old and new.
Best for: Fastest setup, best out-of-the-box experience Configuration:
  • Transcription: Stenox Cloud Transcription
  • Enhancement: Stenox Cloud AI Enhancement
Setup time: 2 minutes (just activate your license) Processing time: ~1-2 seconds total Availability: Plus and Lifetime Plus plans
1

Get Plus or Lifetime Plus

Visit stenox.app/pricing and choose your plan
2

Activate License

Settings → License → Sign in with your purchase email
3

Select Stenox Cloud

Settings → Transcription → Provider: Stenox Cloud Settings → AI Enhancement → Provider: Stenox Cloud
4

Start Dictating

That’s it — no API keys, no configuration, just works
Why Stenox Cloud?
  • Fine-tuned for Stenox — optimized specifically for voice dictation workflows
  • Zero configuration — no API keys to manage, no accounts to create
  • Consistent quality — we maintain and update the models for best results
  • Fast and accurate — production-grade speed and accuracy

Learn More

Full Stenox Cloud details

Option 2: Maximum Free Credits (BYOK)

Best for: Getting started free, maximum free usage Configuration:
  • Transcription: DeepGram Nova-3 ($200 free credits)
  • Enhancement: Google Gemini 2.5 Flash (1,500/day free)
Setup time: 10 minutes Processing time: ~1-2 seconds total Free usage: 6-12 months typical
1

Get DeepGram API key

  1. Visit console.deepgram.com
  2. Sign up (no credit card required)
  3. Create API key
  4. Get $200 in free credits
2

Configure DeepGram in Stenox

  1. Settings → Transcription tab
  2. Provider: DeepGram
  3. Paste API key
  4. Model: Nova-3 (best accuracy)
3

Get Google Gemini API key

  1. Visit aistudio.google.com
  2. Sign in with Google (no credit card)
  3. Create API key
  4. Get 1,500 free requests/day
4

Configure Gemini in Stenox

  1. Settings → AI Enhancement tab
  2. Provider: Google Gemini
  3. Paste API key
  4. Model: Gemini 2.5 Flash (fastest)
Get detailed setup instructions →

Option 3: Groq Stack (Single API Key)

Best for: Simplicity, one account for everything Configuration:
  • Transcription: Groq Whisper v3-turbo
  • Enhancement: Groq Llama 3.3 70B or DeepSeek R1
Setup time: 5 minutes Processing time: ~1-2 seconds total Free usage: Ongoing free tier with rate limits
1

Get Groq API key

  1. Visit console.groq.com
  2. Sign up (no credit card required)
  3. Create API key
  4. One key for both transcription and enhancement!
2

Configure Groq Transcription

  1. Settings → Transcription tab
  2. Provider: Groq
  3. Paste API key
  4. Model: whisper-large-v3-turbo
3

Configure Groq Enhancement

  1. Settings → AI Enhancement tab
  2. Provider: Groq
  3. Paste same API key
  4. Model: Llama 3.3 70B Versatile
Benefits:
  • Single API key management
  • Ultra-fast LPU inference
  • Ongoing free tier (doesn’t expire)
  • Simple setup
Get detailed setup instructions →

Option 4: Best Quality (Paid BYOK)

Best for: Professional use, maximum accuracy Configuration:
  • Transcription: DeepGram Nova-3 or AssemblyAI Universal-1
  • Enhancement: Google Gemini 2.5 Pro
Cost: ~$0.75-1.50 per hour of dictation Processing time: ~1-2 seconds total When to use:
  • Professional transcription services
  • Critical accuracy requirements
  • Business or enterprise use
  • After free credits are exhausted

Performance Comparison

ConfigurationSpeedAccuracySetupCostBest For
Stenox Cloud⚡⚡⚡⭐⭐⭐⭐⭐InstantPlus/Lifetime PlusBest experience, zero config
DeepGram + Gemini⚡⚡⚡⭐⭐⭐⭐10 min$$ after freeMaximum free credits
Groq Stack⚡⚡⚡⭐⭐⭐⭐5 minFree tierOne API key simplicity
AssemblyAI + Gemini⚡⚡⚡⭐⭐⭐⭐10 min$Accent optimization
Local (WhisperKit + MLX)⭐⭐⭐5 minFree foreverPrivacy, offline

Detailed Provider Configurations

DeepGram Nova-3 Settings

Recommended configuration:
SettingValueWhy
ModelNova-3Latest, best accuracy
LanguageEnglish (or auto-detect)Optimize for your language
StreamingEnabled (automatic)Fastest results
PunctuationEnabled (automatic)Auto-punctuation
In Stenox:
  • Provider: DeepGram
  • Model: Nova-3 or Nova-3-English
  • Language: English or Auto-detect
DeepGram full guide →

Google Gemini 2.5 Flash Settings

Recommended configuration:
SettingValueWhy
ModelGemini 2.5 FlashFastest with excellent quality
Custom promptDefault or ProfessionalBased on use case
TemperatureDefaultStenox handles this
In Stenox:
  • Provider: Google Gemini
  • Model: Gemini 2.5 Flash
  • Custom prompt: (optional, see below)
Google Gemini full guide →

Enhancement Prompts

Fine-tuned out of the box. Both Stenox Cloud and local providers come with fine-tuned enhancements that we continue to refine — no configuration needed for most users.
For BYOK providers, you can configure custom enhancement behaviors:
Fix grammar and punctuation errors.
Keep the original meaning and tone unchanged.
Use for: General dictation, notes

Multi-Profile Cloud Setup

For maximum flexibility, create profiles for different scenarios:

Profile Strategy

1

Work Email Profile

  • Name: “Work Email”
  • Transcription: DeepGram Nova-3
  • Enhancement: Gemini 2.5 Flash
  • Custom prompt: Professional email tone
  • Auto-activate: When Gmail or Outlook is active
2

Quick Notes Profile

  • Name: “Quick Notes”
  • Transcription: Groq Whisper (fastest)
  • Enhancement: None (skip for speed)
  • Auto-activate: When Notes app is active
3

Documentation Profile

  • Name: “Documentation”
  • Transcription: DeepGram Nova-3
  • Enhancement: Gemini 2.5 Flash
  • Custom prompt: Technical documentation
  • Auto-activate: When VSCode or similar is active
4

Fallback Local Profile

  • Name: “Offline”
  • Transcription: WhisperKit
  • Enhancement: MLX or None
  • Manual activation: When internet unavailable
Learn about Profiles →

Usage Optimization

Maximize Free Credits

DeepGram ($200 credits):
  • Use for important dictation (work emails, documents)
  • Switch to Groq or local for casual notes
  • Track usage in DeepGram console
Google Gemini (1,500/day):
  • More than enough for most users
  • If you hit limit, create “no enhancement” profile
  • Resets daily at midnight PST
Groq (free tier):
  • Use as primary option — no API costs
  • Rate limits refresh quickly
  • Good for privacy-conscious users who prefer not to create accounts

Cost Control

After free tiers, control costs:
Groq’s ongoing free tier is sufficient for most personal use. Only upgrade if you consistently hit rate limits.
Create a profile without AI enhancement for quick personal notes. Saves on Gemini API calls.
If you dictate heavily (2+ hours/day), switch to WhisperKit + MLX to avoid costs.
AssemblyAI costs only $0.0009/min after free credits - very affordable for continued use.

Troubleshooting Cloud Setup

  • Check your internet connection speed
  • Try switching to different model (e.g., Flux instead of Nova-3)
  • Check provider status pages for outages
  • Test with local provider to isolate issue
  • Ensure you’re using Nova-3 or latest models
  • Check microphone input device (Settings → Audio tab)
  • Reduce background noise
  • Speak more clearly or use better microphone
  • Try AssemblyAI if you have strong accent
  • Verify API keys are correct (no extra spaces)
  • Check API key status in provider dashboards
  • Ensure you have remaining credits/quota
  • Try creating new API keys
  • Create local fallback profile (WhisperKit + MLX)
  • Set up multiple cloud providers and rotate
  • Consider upgrading to paid tier if needed

Internet Connection Requirements

Minimum requirements:
  • Speed: 1 Mbps upload, 1 Mbps download
  • Latency: < 100ms preferred
  • Stability: Consistent connection (not intermittent)
Optimal:
  • Speed: 5+ Mbps upload/download
  • Latency: < 50ms
  • Type: Wi-Fi or Ethernet (not cellular for best results)
Cloud processing sends ~1 MB per minute of audio. Most home internet connections are more than sufficient.

Next Steps

Best setup: Stenox Cloud — fine-tuned for Stenox, zero configuration, just works. Get started at stenox.app/pricing.