xAI Modelos

Explore todos os 3 modelos de xAI com preços detalhados, prós e contras e recomendações para desenvolvedores.

Modelos

$0.200

Menor Entrada

Contexto Máximo

Níveis de Qualidade

Recomendações Rápidas

Melhor Custo-Benefício: Grok 4.1 Fast ($0.200/1M)

Melhor Qualidade: Grok 4.3

Grok 4.3

Flagship

General-purpose, max capability

Preços Oficiais

Quando usar: Best xAI model for general-purpose applications requiring real-time knowledge access.

Destaques da Atualização

◆1M context window — 8x increase over Grok 3's 128K
◆Price: $1.25/$2.50 — 60% cheaper than Grok 3 ($3/$15)
◆Real-time web + X (Twitter) search — unique live knowledge access
◆Context caching: $0.20/M — 84% savings for repeated prefixes
◆32K max output — 4x Grok 3's 8K for long-form generation

Preço de Entrada

$1.25

per 1M tokens

Preço de Saída

$2.50

per 1M tokens

Entrada em Cache

$0.200

per 1M tokens

Entrada em Lote

—

per 1M tokens

Janela de Contexto: 1M

Saída Máxima: 32,000 tokens

Corte de Conhecimento: 2025-06

VisãoChamada de FunçãoAjuste FinoModo JSON

Prós

1M context window at $1.25/M input
Most intelligent xAI model
Real-time web and X search

Contras

No batch API
No fine-tuning
Newer ecosystem

Desempenho

Velocidade de saída~65 tok/s

Limite de taxa3,000 RPM

Multimodal

Entrada de imagemSaída de imagemEntrada de áudioSaída de áudio

Benchmarks

MMLU

87.5%

SWE-bench Verified

62.0%

GPQA

73.0%

Grok 4.1 Fast

Mid-tier

Cost-optimized production workloads

Preços Oficiais

Quando usar: Best for latency-sensitive production apps and long-document processing on a budget.

Destaques da Atualização

◆2M context window — largest among all mid-tier models
◆$0.20/M input — 6x cheaper than Grok 4.3 for high-volume tasks
◆Ultra-low latency — optimized for sub-500ms response times
◆16K max output — 2x Grok 3's 8K for longer generations
◆Context caching: $0.05/M — 75% savings for repeated prefixes

Preço de Entrada

$0.200

per 1M tokens

Preço de Saída

$0.500

per 1M tokens

Entrada em Cache

$0.050

per 1M tokens

Entrada em Lote

—

per 1M tokens

Janela de Contexto: 2M

Saída Máxima: 16,000 tokens

Corte de Conhecimento: 2025-04

VisãoChamada de FunçãoAjuste FinoModo JSON

Prós

2M context window — largest among mid-tier
Extremely fast response times
One of lowest rates for frontier APIs

Contras

Lower quality than Grok 4.3
No batch API
No fine-tuning

Desempenho

Velocidade de saída~110 tok/s

Limite de taxa8,000 RPM

Multimodal

Entrada de imagemSaída de imagemEntrada de áudioSaída de áudio

Benchmarks

MMLU

83.0%

HumanEval

80.0%

Grok 3

Flagship

Real-time info, analysis

Preços Oficiais

Quando usar: When real-time X/Twitter data or very current knowledge is essential.

Destaques da Atualização

◆Real-time X/Twitter integration — unique live social media knowledge
◆Knowledge cutoff: 2025-02 — one of the most current available
◆Vision + function calling — full multimodal capability at launch
◆No caching or batch API — upgrade to Grok 4.3 for caching support
◆128K context — superseded by Grok 4.3's 1M for long-document tasks

Preço de Entrada

$3.00

per 1M tokens

Preço de Saída

$15.00

per 1M tokens

Entrada em Cache

—

per 1M tokens

Entrada em Lote

—

per 1M tokens

Janela de Contexto: 131K

Saída Máxima: 8,192 tokens

Corte de Conhecimento: 2025-02

VisãoChamada de FunçãoAjuste FinoModo JSON

Prós

Very recent knowledge cutoff
Real-time X/Twitter integration
Vision + function calling

Contras

Expensive ($3/$15)
No caching or batch
No fine-tuning

Desempenho

Velocidade de saída~55 tok/s

Limite de taxa2,000 RPM

Multimodal

Entrada de imagemSaída de imagemEntrada de áudioSaída de áudio

Benchmarks

MMLU

85.0%

GPQA

68.0%

MATH

70.0%

Comparação Lado a Lado

Modelo	Nível	Entrada	Saída	Em Cache	Contexto	Saída Máxima
Grok 4.3	Flagship	$1.25	$2.50	$0.200	1M	32,000
Grok 4.1 Fast	Mid-tier	$0.200	$0.500	$0.050	2M	16,000
Grok 3	Flagship	$3.00	$15.00	—	131K	8,192