Back to Developer Zone

Anthropic Models

Explore all 6 models from Anthropic with detailed pricing, pros & cons, and developer recommendations.

6
Models
$0.800
Lowest Input
1M
Max Context
2
Quality Tiers

Quick Recommendations

Best Value: Claude 3.5 Haiku ($0.800/1M)
Best Quality: Claude Opus 4.8

Claude Opus 4.8

Flagship

Frontier coding, AI agents

Official Pricing

When to use: When you need the absolute latest Anthropic intelligence for complex coding and agent workflows.

Upgrade Highlights

  • Latest Anthropic flagship: knowledge cutoff 2025-06 (2 months newer than 4.7)
  • Fast mode available for premium speed — 2x faster responses
  • 1M context window (up from 200K in Claude 4 Opus)
  • Same $5/$25 pricing as Opus 4.7 — more capability at same cost
  • Improved agentic coding: better multi-step tool orchestration
Input Price
$5.00
per 1M tokens
Output Price
$25.00
per 1M tokens
Cached Input
$0.500
per 1M tokens
Batch Input
per 1M tokens
Context Window: 1M
Max Output: 32,000 tokens
Knowledge Cutoff: 2025-06
VisionFunction CallingFine-tuningJSON ModeFree Tier

Pros

  • Latest Anthropic flagship with 1M context
  • Fast mode available for premium speed
  • Best-in-class agentic coding

Cons

  • Expensive output at $25/M tokens
  • No batch API
  • No fine-tuning

Performance

Output Speed~65 tok/s
Rate Limit4,000 RPM

Multimodal

Image InputImage OutputAudio InputAudio Output

Benchmarks

SWE-bench Verified
79.8%
MMLU
89.8%
GPQA
78.5%

Claude Opus 4.7

Flagship

Agentic coding, complex analysis

Official Pricing

When to use: Top choice for long-horizon coding agents and complex analytical workloads.

Upgrade Highlights

  • SWE-Bench Pro: 64.3% — leads all models on real-world coding
  • 1M context window — 5x increase over Claude 4 Opus's 200K
  • Same $5/$25 pricing as Opus 4.6 with better performance
  • Knowledge cutoff: 2025-04 — 1 month newer than Claude 4
  • Best-in-class for long-horizon multi-step coding agents
Input Price
$5.00
per 1M tokens
Output Price
$25.00
per 1M tokens
Cached Input
$0.500
per 1M tokens
Batch Input
per 1M tokens
Context Window: 1M
Max Output: 32,000 tokens
Knowledge Cutoff: 2025-04
VisionFunction CallingFine-tuningJSON ModeFree Tier

Pros

  • 1M context window
  • 64.3% on SWE-Bench Pro (leads competition)
  • Same price as Opus 4.6 with better performance

Cons

  • Still expensive at $5/$25
  • No batch API
  • No fine-tuning

Performance

Output Speed~60 tok/s
Rate Limit4,000 RPM

Multimodal

Image InputImage OutputAudio InputAudio Output

Benchmarks

SWE-bench Pro
64.3%
MMLU
89.5%
GPQA
77.0%

Agents Using This Model

1

Claude Haiku 4.5

Lite

Fast, cost-efficient AI tasks

Official Pricing

When to use: Best Anthropic value for production chatbots, coding assistants, and agent tasks.

Upgrade Highlights

  • Matches Sonnet 4 coding performance at 1/3 the price
  • Vision support at lite tier — first Haiku with multimodal capability
  • 200K context window (same as Claude 4 Opus/Sonnet)
  • Knowledge cutoff: 2025-02 — newest at the Haiku tier
  • Price: $1/$5 — 5x more than 3.5 Haiku but 3x quality improvement
Input Price
$1.00
per 1M tokens
Output Price
$5.00
per 1M tokens
Cached Input
$0.100
per 1M tokens
Batch Input
per 1M tokens
Context Window: 200K
Max Output: 8,192 tokens
Knowledge Cutoff: 2025-02
VisionFunction CallingFine-tuningJSON ModeFree Tier

Pros

  • Matches Sonnet 4 performance on coding tasks
  • Vision support at lite tier
  • 200K context window

Cons

  • 5x more expensive than 3.5 Haiku
  • 8K max output is limiting
  • No fine-tuning

Performance

Output Speed~100 tok/s
Rate Limit8,000 RPM

Multimodal

Image InputImage OutputAudio InputAudio Output

Benchmarks

SWE-bench Verified
58.0%
MMLU
85.0%
HumanEval
84.5%

Claude 4 Opus

Flagship

Agentic coding, deep analysis

Official Pricing

When to use: When you need the absolute best for complex agentic coding and deep analytical work.

Upgrade Highlights

  • SWE-bench Verified: 72.7% — highest agentic coding score at launch
  • 200K context with 32K max output — 4x Claude 3.5 Sonnet's 8K output
  • Batch API available: $7.50/$37.50 — 50% savings for async workloads
  • Knowledge cutoff: 2025-03 — 6 months newer than 3.5 Sonnet
  • Premium pricing ($15/$75) reflects highest intelligence tier
Input Price
$15.00
per 1M tokens
Output Price
$75.00
per 1M tokens
Cached Input
$1.50
per 1M tokens
Batch Input
$7.50
per 1M tokens
Context Window: 200K
Max Output: 32,000 tokens
Knowledge Cutoff: 2025-03
VisionFunction CallingFine-tuningJSON ModeFree Tier

Pros

  • Best agentic coding model available
  • Superior deep analysis
  • Latest knowledge cutoff (2025-03)

Cons

  • Most expensive model in market ($15/$75)
  • No fine-tuning
  • Slower than Sonnet

Performance

Output Speed~45 tok/s
Rate Limit2,000 RPM

Multimodal

Image InputImage OutputAudio InputAudio Output

Benchmarks

SWE-bench Verified
72.7%
MMLU
89.2%
GPQA
76.8%
MATH
78.0%

Claude 4 Sonnet

Flagship

Coding, writing, analysis

Official Pricing

When to use: Best balance of quality and cost for writing, coding, and analysis production apps.

Upgrade Highlights

  • SWE-bench: 63.8% — competitive with flagships at 1/5 the price
  • Batch API: $1.50/$7.50 — 50% savings for non-real-time workloads
  • 200K context + 16K output — 2x output of Claude 3.5 Sonnet
  • Top-tier writing quality — best for content generation and editing
  • Price $3/$15 — replaced Claude 3.5 Sonnet as the default choice
Input Price
$3.00
per 1M tokens
Output Price
$15.00
per 1M tokens
Cached Input
$0.300
per 1M tokens
Batch Input
$1.50
per 1M tokens
Context Window: 200K
Max Output: 16,384 tokens
Knowledge Cutoff: 2025-03
VisionFunction CallingFine-tuningJSON ModeFree Tier

Pros

  • Top-tier writing quality
  • 5x cheaper than Opus
  • Excellent coding with 200K context

Cons

  • Lower max output than Opus
  • No fine-tuning
  • Slower than GPT-4o mini

Performance

Output Speed~70 tok/s
Rate Limit4,000 RPM

Multimodal

Image InputImage OutputAudio InputAudio Output

Benchmarks

SWE-bench Verified
63.8%
MMLU
88.0%
HumanEval
91.0%
MATH
72.5%

Claude 3.5 Haiku

Lite

Fast, affordable tasks

Official Pricing

When to use: For high-throughput text tasks where you want Anthropic quality on a budget.

Upgrade Highlights

  • First budget Anthropic model with 200K context (was 100K in Claude 3 Haiku)
  • Prompt caching: $0.08/M — 90% savings for repeated system prompts
  • Batch API: $0.40/$2 — 50% savings for async processing
  • Claude's nuanced writing style at 1/4 the price of Sonnet
  • No vision — text-only; upgrade to Haiku 4.5 for multimodal
Input Price
$0.800
per 1M tokens
Output Price
$4.00
per 1M tokens
Cached Input
$0.080
per 1M tokens
Batch Input
$0.400
per 1M tokens
Context Window: 200K
Max Output: 8,192 tokens
Knowledge Cutoff: 2024-07
VisionFunction CallingFine-tuningJSON ModeFree Tier

Pros

  • 200K context at budget price
  • Claude's nuanced style at low cost
  • Good prompt caching savings

Cons

  • No vision support
  • No fine-tuning
  • 8K max output is limiting

Performance

Output Speed~95 tok/s
Rate Limit8,000 RPM

Multimodal

Image InputImage OutputAudio InputAudio Output

Benchmarks

MMLU
83.0%
HumanEval
82.0%
MATH
63.5%

Side-by-Side Comparison

ModelTierInputOutputContext
Claude Opus 4.8Flagship$5.00$25.001M
Claude Opus 4.7Flagship$5.00$25.001M
Claude Haiku 4.5Lite$1.00$5.00200K
Claude 4 OpusFlagship$15.00$75.00200K
Claude 4 SonnetFlagship$3.00$15.00200K
Claude 3.5 HaikuLite$0.800$4.00200K