Overview
Groq provides blazing-fast inference for open-source large language models using their LPU (Language Processing Unit) architecture. This enables some of the fastest AI enhancement available with excellent quality. Key Benefits:- ⚡ Blazing fast - Sub-second enhancement with Groq’s LPU
- 🎯 High quality - Llama 3.3/4, Qwen 3, DeepSeek R1 models
- 🔄 Same API key - Use for both Groq Whisper and Groq LLMs
- 🎁 Free tier - Generous free usage with rate limits
Free Tier
Free Tier Limits
Groq’s free tier varies by model. Generally:- Rate limits instead of credit limits
- Tokens per minute limits per model
- Sufficient for personal use and many professional use cases
- Check console.groq.com for current limits
When you hit rate limits, either wait a few minutes or use a fallback profile with MLX (local).
Getting Your Free API Key
Visit Groq Console
Go to console.groq.com in your browser.
Sign up for free
Click Sign Up and create an account with:
- Email and password
- Or sign in with Google/GitHub
Create API key
Once logged in:
- Go to API Keys section
- Click Create API Key
- Give it a name (e.g., “Stenox”)
- Copy the key immediately (can’t view again!)
One API key for everything: This same key works for Groq Whisper (transcription) and Groq LLMs (enhancement). Configure once!
Available Models
Groq offers multiple open-source LLMs optimized for their LPU hardware:Llama 3.3 70B Versatile (Recommended)
Best all-around model:- Excellent grammar correction and formatting
- Balanced speed and quality
- Versatile for all use cases
- Good context understanding
DeepSeek R1 Distill Llama 70B
Reasoning-focused model:- Excellent for complex edits
- Strong logical reasoning
- Very good at restructuring
- Slightly slower than Llama 3.3
Llama 4 Scout 17B
Faster, smaller model:- Faster inference than 70B models
- Good quality for size
- Lower resource usage
- Good for quick enhancement
Qwen 3 72B
Multilingual specialist:- Excellent for non-English languages
- Good grammar correction
- Strong formatting abilities
Configuration in Stenox
What Groq LLMs Do
Grammar Correction
- Comprehensive grammar fixes
- Subject-verb agreement
- Tense consistency
- Article and preposition corrections
- Complex sentence structure improvements
- Before: “me and my team was working on the project all day yesterday and we finally finished it”
- After: “My team and I were working on the project all day yesterday, and we finally finished it.”
Formatting & Punctuation
- Professional punctuation
- Proper capitalization
- Number and date formatting
- List structuring
- Paragraph organization
- Before: “our meeting is scheduled for march 15th at 2pm please bring the q4 reports”
- After: “Our meeting is scheduled for March 15th at 2 PM. Please bring the Q4 reports.”
Tone Adjustment
With custom prompts, Groq LLMs excel at tone modification:- Professional tone - Formal, business-appropriate
- Casual tone - Friendly, conversational
- Technical tone - Precise, documentation-style
- Persuasive tone - Engaging, convincing
Advanced Capabilities
DeepSeek R1 especially excels at:- Restructuring for clarity
- Making implicit information explicit
- Improving logical flow
- Complex reasoning-based edits
Performance
Expected enhancement times with Groq LLMs:| Transcription Length | Enhancement Time |
|---|---|
| Short (1-2 sentences) | < 0.5 seconds |
| Medium (paragraph) | < 1 second |
| Long (multiple paragraphs) | 1-2 seconds |
Groq’s LPU architecture makes enhancement nearly instantaneous, even for large models like 70B parameter LLMs.
Custom Prompts
Configure how Groq LLMs enhance your text in Profile settings:- Default
- Professional Email
- Technical Documentation
- Creative Writing
Usage Tracking
Monitor your Groq usage:- Visit console.groq.com
- Go to Usage section
- View:
- API calls made
- Tokens consumed
- Rate limit status
- Current tier
Stenox does not track Groq usage. Check console.groq.com for accurate statistics.
Privacy Considerations
What Groq sees:- Your transcribed text is sent to Groq servers
- Text may be logged for service improvement
- Data is encrypted in transit (HTTPS)
- Review Groq’s privacy policy for details
- Use MLX (local) for 100% offline enhancement
When to Use Groq LLMs
Ultra-fast processing
LPU-accelerated inference for near-instant results.
High quality
70B parameter models with excellent capabilities.
Groq ecosystem
Same API key as Groq Whisper - seamless integration.
Free tier
Generous free usage for personal and professional use.
Groq Ecosystem: Full Pipeline
Use Groq for both transcription and enhancement:Complete Groq Setup
Benefits:
- Single account management
- Unified billing/usage tracking
- Ultra-fast end-to-end processing (~2 seconds total)
- Simple configuration
Troubleshooting
API key not working
API key not working
- Ensure you copied the complete key (no spaces)
- Verify account is active (check email verification)
- Try creating a new API key
- Check for typos
Rate limit exceeded
Rate limit exceeded
- You’ve hit free tier rate limits
- Wait a few minutes and try again
- Use MLX (local) as backup
- Create a profile without enhancement for quick notes
- Upgrade to paid tier for higher limits
Enhancement fails
Enhancement fails
- Check internet connection
- Verify API key is valid in Groq console
- Check Groq status page for service issues
- Try a shorter test enhancement
Quality not as expected
Quality not as expected
- Try DeepSeek R1 for better reasoning
- Adjust custom prompt in Profile settings
- Provide more specific enhancement instructions
- Ensure transcription quality is good (can’t fix bad transcription)
Pricing After Free Tier
Groq offers competitive pay-as-you-go pricing when you exceed free tier:- Pricing varies by model - Check console.groq.com
- Very affordable - Among the cheapest cloud LLM options
- No commitment - Pay only for usage
Most users stay within free tier limits. Check Groq console for current pricing.
Model Comparison
| Model | Speed | Quality | Best For |
|---|---|---|---|
| Llama 3.3 70B | Very Fast | Excellent | General use, balanced |
| DeepSeek R1 70B | Fast | Superior | Complex edits, reasoning |
| Llama 4 Scout 17B | Fastest | Very Good | Speed-critical |
| Qwen 3 72B | Very Fast | Excellent | Multilingual |

