Overview
Groq provides blazing-fast inference for open-source large language models using their LPU (Language Processing Unit) architecture. This enables some of the fastest AI enhancement available with excellent quality. Key Benefits:- ⚡ Blazing fast - Sub-second enhancement with Groq’s LPU
- 🎯 High quality - Llama 3.3/4, Qwen 3, DeepSeek R1 models
- 🔄 Same API key - Use for both Groq Whisper and Groq LLMs
- 🎁 Free tier - Generous free usage with rate limits
Free Tier
Free Tier Limits
Groq’s free tier varies by model. Generally:- Rate limits instead of credit limits
- Tokens per minute limits per model
- Sufficient for personal use and many professional use cases
- Check console.groq.com for current limits
When you hit rate limits, either wait a few minutes or use a fallback profile with MLX (local).
Getting Your Free API Key
1
Visit Groq Console
Go to console.groq.com in your browser.
2
Sign up for free
Click Sign Up and create an account with:
- Email and password
- Or sign in with Google/GitHub
3
Verify your email
Check your email for verification link from Groq and click it.
4
Create API key
Once logged in:
- Go to API Keys section
- Click Create API Key
- Give it a name (e.g., “Stenox”)
- Copy the key immediately (can’t view again!)
5
Add key to Stenox
- Open Stenox Settings
- Go to AI Enhancement tab
- Select Groq as provider
- Paste your API key
- Choose your model (Llama 3.3 70B or DeepSeek R1 recommended)
- Click Save
One API key for everything: This same key works for Groq Whisper (transcription) and Groq LLMs (enhancement). Configure once!
Available Models
Groq offers multiple open-source LLMs optimized for their LPU hardware:Llama 3.3 70B Versatile (Recommended)
Best all-around model:- Excellent grammar correction and formatting
- Balanced speed and quality
- Versatile for all use cases
- Good context understanding
DeepSeek R1 Distill Llama 70B
Reasoning-focused model:- Excellent for complex edits
- Strong logical reasoning
- Very good at restructuring
- Slightly slower than Llama 3.3
Llama 4 Scout 17B
Faster, smaller model:- Faster inference than 70B models
- Good quality for size
- Lower resource usage
- Good for quick enhancement
Qwen 3 72B
Multilingual specialist:- Excellent for non-English languages
- Good grammar correction
- Strong formatting abilities
Configuration in Stenox
1
Open Stenox Settings
Click Stenox menu bar icon → Settings
2
Go to AI Enhancement tab
Select the AI Enhancement tab
3
Select Groq
Choose Groq from the provider dropdown
4
Enter API key
Paste your API key from console.groq.com(Same key as Groq Whisper if you use it!)
5
Choose model
Select Llama 3.3 70B Versatile (recommended)
6
Optional: Custom prompt
Configure enhancement behavior (see Custom Prompts section)
7
Save
Click Save to activate Groq LLMs
What Groq LLMs Do
Grammar Correction
- Comprehensive grammar fixes
- Subject-verb agreement
- Tense consistency
- Article and preposition corrections
- Complex sentence structure improvements
- Before: “me and my team was working on the project all day yesterday and we finally finished it”
- After: “My team and I were working on the project all day yesterday, and we finally finished it.”
Formatting & Punctuation
- Professional punctuation
- Proper capitalization
- Number and date formatting
- List structuring
- Paragraph organization
- Before: “our meeting is scheduled for march 15th at 2pm please bring the q4 reports”
- After: “Our meeting is scheduled for March 15th at 2 PM. Please bring the Q4 reports.”
Tone Adjustment
With custom prompts, Groq LLMs excel at tone modification:- Professional tone - Formal, business-appropriate
- Casual tone - Friendly, conversational
- Technical tone - Precise, documentation-style
- Persuasive tone - Engaging, convincing
Advanced Capabilities
DeepSeek R1 especially excels at:- Restructuring for clarity
- Making implicit information explicit
- Improving logical flow
- Complex reasoning-based edits
Performance
Expected enhancement times with Groq LLMs:| Transcription Length | Enhancement Time |
|---|---|
| Short (1-2 sentences) | < 0.5 seconds |
| Medium (paragraph) | < 1 second |
| Long (multiple paragraphs) | 1-2 seconds |
Groq’s LPU architecture makes enhancement nearly instantaneous, even for large models like 70B parameter LLMs.
Custom Prompts
Configure how Groq LLMs enhance your text in Profile settings:- Default
- Professional Email
- Technical Documentation
- Creative Writing
Usage Tracking
Monitor your Groq usage:- Visit console.groq.com
- Go to Usage section
- View:
- API calls made
- Tokens consumed
- Rate limit status
- Current tier
Stenox does not track Groq usage. Check console.groq.com for accurate statistics.
Privacy Considerations
What Groq sees:- Your transcribed text is sent to Groq servers
- Text may be logged for service improvement
- Data is encrypted in transit (HTTPS)
- Review Groq’s privacy policy for details
- Use MLX (local) for 100% offline enhancement
When to Use Groq LLMs
Ultra-fast processing
LPU-accelerated inference for near-instant results.
High quality
70B parameter models with excellent capabilities.
Groq ecosystem
Same API key as Groq Whisper - seamless integration.
Free tier
Generous free usage for personal and professional use.
Groq Ecosystem: Full Pipeline
Use Groq for both transcription and enhancement:Complete Groq Setup
1
Sign up once
Create one Groq account at console.groq.com
2
One API key
Generate a single API key for all Groq services
3
Configure transcription
Use Groq Whisper (whisper-large-v3-turbo) for transcription
4
Configure enhancement
Use Groq LLMs (Llama 3.3 70B or DeepSeek R1) for enhancement
5
Done!
One provider, one API key, complete pipeline
- Single account management
- Unified billing/usage tracking
- Ultra-fast end-to-end processing (~2 seconds total)
- Simple configuration
Troubleshooting
API key not working
API key not working
- Ensure you copied the complete key (no spaces)
- Verify account is active (check email verification)
- Try creating a new API key
- Check for typos
Rate limit exceeded
Rate limit exceeded
- You’ve hit free tier rate limits
- Wait a few minutes and try again
- Use MLX (local) as backup
- Create a profile without enhancement for quick notes
- Upgrade to paid tier for higher limits
Enhancement fails
Enhancement fails
- Check internet connection
- Verify API key is valid in Groq console
- Check Groq status page for service issues
- Try a shorter test enhancement
Quality not as expected
Quality not as expected
- Try DeepSeek R1 for better reasoning
- Adjust custom prompt in Profile settings
- Provide more specific enhancement instructions
- Ensure transcription quality is good (can’t fix bad transcription)
Pricing After Free Tier
Groq offers competitive pay-as-you-go pricing when you exceed free tier:- Pricing varies by model - Check console.groq.com
- Very affordable - Among the cheapest cloud LLM options
- No commitment - Pay only for usage
Most users stay within free tier limits. Check Groq console for current pricing.
Model Comparison
| Model | Speed | Quality | Best For |
|---|---|---|---|
| Llama 3.3 70B | Very Fast | Excellent | General use, balanced |
| DeepSeek R1 70B | Fast | Superior | Complex edits, reasoning |
| Llama 4 Scout 17B | Fastest | Very Good | Speed-critical |
| Qwen 3 72B | Very Fast | Excellent | Multilingual |

