Back to Developer ZoneOfficial Pricing Official Pricing Official Pricing
Mistral Models
Explore all 3 models from Mistral with detailed pricing, pros & cons, and developer recommendations.
3
Models
$0.400
Lowest Input
131K
Max Context
2
Quality Tiers
Quick Recommendations
Best Value: Mistral Medium 3 ($0.400/1M)
Best Quality: Mistral Large 2
Mistral Large 2
FlagshipMultilingual, complex tasks
When to use: Top pick for multilingual enterprise apps, especially EU-based deployments.
Upgrade Highlights
- ◆MMLU: 84.0% — competitive with GPT-4 class models
- ◆Fine-tuning available — customize for domain-specific tasks
- ◆EU-based (GDPR compliant) — only major flagship with EU sovereignty
- ◆12 languages optimized — best-in-class for French, German, Spanish
- ◆128K context is smaller vs 1M competitors — trade-off for EU compliance
Input Price
$2.00
per 1M tokens
Output Price
$6.00
per 1M tokens
Cached Input
—
per 1M tokens
Batch Input
—
per 1M tokens
Context Window: 131K
Max Output: 8,192 tokens
Knowledge Cutoff: 2024-07
VisionFunction CallingFine-tuningJSON ModeFree Tier
Pros
- Excellent multilingual (French, German, etc.)
- Fine-tuning available
- EU-based (GDPR friendly)
Cons
- No vision
- 128K context is small vs competitors
- No cached/batch pricing
Performance
Output Speed~55 tok/s
Rate Limit3,000 RPM
Multimodal
Image InputImage OutputAudio InputAudio Output
Benchmarks
MMLU
84.0%
HumanEval
82.0%
MATH
65.0%
Mistral Medium 3
Mid-tierBalanced multilingual
When to use: Cost-effective multilingual + vision for European market applications.
Upgrade Highlights
- ◆Vision added at mid-tier — multimodal capability at $0.40/M input
- ◆5x cheaper than Mistral Large ($0.40 vs $2/M) for lighter tasks
- ◆EU-based infrastructure — GDPR compliant for European data
- ◆128K context — sufficient for most enterprise document processing
- ◆No fine-tuning — upgrade to Large 2 for custom model training
Input Price
$0.400
per 1M tokens
Output Price
$2.00
per 1M tokens
Cached Input
—
per 1M tokens
Batch Input
—
per 1M tokens
Context Window: 131K
Max Output: 8,192 tokens
Knowledge Cutoff: 2024-07
VisionFunction CallingFine-tuningJSON ModeFree Tier
Pros
- Vision at mid-tier price
- Good multilingual balance
- EU-based
Cons
- No fine-tuning
- No cached/batch pricing
- Smaller context than competitors
Performance
Output Speed~75 tok/s
Rate Limit5,000 RPM
Multimodal
Image InputImage OutputAudio InputAudio Output
Benchmarks
MMLU
80.5%
HumanEval
78.0%
Mixtral 8x22B
Mid-tierOpen-weight, high throughput
When to use: For self-hosting or fine-tuning on domain-specific data with high throughput needs.
Upgrade Highlights
- ◆Open-weight MoE: 8x22B params, only 39B active per token — high throughput
- ◆Fine-tunable — full weight access for domain adaptation
- ◆Function calling + JSON mode — enterprise-ready tool integration
- ◆65K context — smaller than newer models but sufficient for most tasks
- ◆Self-hostable — no per-token cost when running on own infrastructure
Input Price
$0.900
per 1M tokens
Output Price
$2.70
per 1M tokens
Cached Input
—
per 1M tokens
Batch Input
—
per 1M tokens
Context Window: 66K
Max Output: 4,096 tokens
Knowledge Cutoff: 2024-01
VisionFunction CallingFine-tuningJSON ModeFree Tier
Pros
- Open-weight MoE architecture
- Fine-tunable
- High throughput via sparse activation
Cons
- Only 65K context
- No vision
- Older knowledge cutoff (2024-01)
Performance
Output Speed~85 tok/s
Rate Limit—
Multimodal
Image InputImage OutputAudio InputAudio Output
Benchmarks
MMLU
77.8%
HumanEval
75.5%
MATH
58.0%
Side-by-Side Comparison
| Model | Tier | Input | Output | Context |
|---|---|---|---|---|
| Mistral Large 2 | Flagship | $2.00 | $6.00 | 131K |
| Mistral Medium 3 | Mid-tier | $0.400 | $2.00 | 131K |
| Mixtral 8x22B | Mid-tier | $0.900 | $2.70 | 66K |