返回开发者专区

DeepSeek 模型

探索 DeepSeek 的所有 4 个模型,包括详细定价、优缺点和开发者推荐。

4
模型
$0.140
最低输入价格
1M
最大上下文
2
质量层级

快速推荐

最佳性价比: DeepSeek V4 Flash ($0.140/1M)
最佳质量: DeepSeek V4 Flash
最佳推理: DeepSeek V4 Pro

DeepSeek V4 Pro

Reasoning

Deep reasoning, complex analysis

官方定价

适用场景: Best budget reasoning model with massive output capacity for math, logic, and analysis.

核心升级

  • 384K max output — largest in the market (3x GPT-5.5's 128K)
  • 1M context window — 16x increase over DeepSeek R1's 65K
  • 99% cheaper than GPT-5.5 for reasoning ($0.435 vs $5/M input)
  • Cached input: $0.003625/M — 99.2% savings for repeated prefixes
  • Thinking mode: toggleable deep reasoning without extra cost
输入价格
$0.435
per 1M tokens
输出价格
$0.870
per 1M tokens
缓存输入
$0.0036
per 1M tokens
批量输入
per 1M tokens
上下文窗口: 1M
最大输出: 384,000 tokens
知识截止日期: 2025-06
视觉函数调用微调JSON 模式

优点

  • 384K max output — largest available
  • 1M context window
  • 99% cheaper than GPT-5.5 for reasoning

缺点

  • No vision support
  • No fine-tuning
  • Thinking mode adds token overhead

性能

输出速度~30 tok/s
速率限制2,000 RPM

多模态能力

图像输入图像输出音频输入音频输出

基准测试

MMLU
88.0%
AIME 2024
85.0%
SWE-bench Verified
71.0%
MATH
82.5%

DeepSeek V4 Flash

Flagship

Ultra-cheap coding, general tasks

官方定价

适用场景: Best for high-volume coding and general tasks where cost is the primary concern.

核心升级

  • Cheapest LLM available: $0.14/M input, $0.28/M output
  • 1M context + 384K output — same capacity as V4 Pro at 1/3 the price
  • Dual mode: thinking and non-thinking — switch based on task complexity
  • Cached input: $0.0028/M — 98% savings for system prompts
  • No vision — text-only; use GPT-4.1 or Gemini for multimodal needs
输入价格
$0.140
per 1M tokens
输出价格
$0.280
per 1M tokens
缓存输入
$0.0028
per 1M tokens
批量输入
per 1M tokens
上下文窗口: 1M
最大输出: 384,000 tokens
知识截止日期: 2025-06
视觉函数调用微调JSON 模式

优点

  • Cheapest LLM available at $0.14/M input
  • 1M context + 384K output
  • Supports both thinking and non-thinking modes

缺点

  • No vision support
  • Quality below GPT-5.4 for complex tasks
  • No fine-tuning

性能

输出速度~60 tok/s
速率限制5,000 RPM

多模态能力

图像输入图像输出音频输入音频输出

基准测试

MMLU
84.5%
HumanEval
86.0%
SWE-bench Verified
55.0%

DeepSeek V3

Flagship

Coding, math, reasoning

官方定价

适用场景: Best budget flagship for coding-heavy workloads. 10x cheaper than GPT-4.1 for comparable quality.

核心升级

  • 10x cheaper than GPT-4.1 ($0.27 vs $2/M input) for comparable coding
  • Fine-tuning available — customize for domain-specific coding tasks
  • Prompt caching: $0.07/M — 74% savings for repeated system prompts
  • 671B MoE params with 37B active — flagship intelligence at budget price
  • 65K context is small vs 1M competitors — upgrade to V4 for long context
输入价格
$0.270
per 1M tokens
输出价格
$1.10
per 1M tokens
缓存输入
$0.070
per 1M tokens
批量输入
per 1M tokens
上下文窗口: 66K
最大输出: 8,192 tokens
知识截止日期: 2024-07
视觉函数调用微调JSON 模式

优点

  • Extremely cheap for flagship quality
  • Excellent coding & math
  • Fine-tuning + caching available

缺点

  • Only 65K context
  • No vision
  • Limited function calling reliability

性能

输出速度~50 tok/s
速率限制3,000 RPM

多模态能力

图像输入图像输出音频输入音频输出

基准测试

MMLU
87.1%
SWE-bench Verified
50.0%
MATH
75.0%
HumanEval
89.0%

使用此模型的智能体

2

DeepSeek R1

Reasoning

Deep reasoning, math proofs

官方定价

适用场景: Budget reasoning model — use when o3 cost is prohibitive for math/logic pipelines.

核心升级

  • 4x cheaper than o3 ($0.55 vs $2/M input) for comparable reasoning
  • AIME 2024: 79.8% — competitive with o1 on math benchmarks
  • Prompt caching: $0.14/M — 75% savings for repeated prefixes
  • Open-source reasoning model — self-host for unlimited usage
  • No function calling or vision — pure reasoning; upgrade to V4 Pro for tools
输入价格
$0.550
per 1M tokens
输出价格
$2.19
per 1M tokens
缓存输入
$0.140
per 1M tokens
批量输入
per 1M tokens
上下文窗口: 66K
最大输出: 8,192 tokens
知识截止日期: 2024-07
视觉函数调用微调JSON 模式

优点

  • 4x cheaper than o3 for reasoning
  • Strong math & logic
  • Good prompt caching savings

缺点

  • No function calling
  • No vision or fine-tuning
  • Only 65K context

性能

输出速度~20 tok/s
速率限制2,000 RPM

多模态能力

图像输入图像输出音频输入音频输出

基准测试

AIME 2024
79.8%
MATH
97.3%
GPQA
71.5%

并排比较

模型层级输入输出上下文
DeepSeek V4 ProReasoning$0.435$0.8701M
DeepSeek V4 FlashFlagship$0.140$0.2801M
DeepSeek V3Flagship$0.270$1.1066K
DeepSeek R1Reasoning$0.550$2.1966K