Meta Modèles

Explorez les 2 modèles de Meta avec des prix détaillés, avantages et inconvénients, et recommandations pour développeurs.

Modèles

$0.100

Entrée la moins chère

10M

Contexte max

Niveaux de qualité

Recommandations rapides

Meilleur rapport qualité-prix: Llama 4 Scout ($0.100/1M)

Meilleure qualité: Llama 4 Maverick

Llama 4 Maverick

Flagship

Open-source, multimodal

Tarifs officiels

Quand l'utiliser: For teams wanting open-source control or self-hosting with multimodal needs.

Points clés de la mise à niveau

◆Open-source — self-host for free, full model weight control
◆1M context window — first open-source model with this capacity
◆Multimodal (text + vision) + fine-tunable — unique combination
◆17B active params (109B total) — MoE architecture for efficiency
◆4K max output is limiting — use for input-heavy, short-output tasks

Prix d'entrée

$0.200

per 1M tokens

Prix de sortie

$0.600

per 1M tokens

Entrée en cache

—

per 1M tokens

Entrée batch

—

per 1M tokens

Fenêtre de contexte: 1M

Sortie max: 4,096 tokens

Date de coupure des connaissances: 2024-08

VisionAppel de fonctionAjustement finMode JSONNiveau gratuit

Avantages

Open-source — can self-host for free
1M context window
Multimodal + fine-tunable

Inconvénients

Only 4K max output
No JSON mode
Hosted pricing via third-party (Together AI)

Performance

Vitesse de sortie~80 tok/s

Limite de débit—

Multimodal

Entrée imageSortie imageEntrée audioSortie audio

Benchmarks

MMLU

84.5%

HumanEval

83.0%

SWE-bench Verified

44.2%

Llama 4 Scout

Mid-tier

Open-source, long context

Tarifs officiels

Quand l'utiliser: Unmatched for processing very long documents. Best for RAG with massive context windows.

Points clés de la mise à niveau

◆10M token context — 10x larger than any other model available
◆Open-source + fine-tunable — self-host for unlimited usage
◆$0.10/M input — cheapest per-token model in the market
◆17B active params (109B total) — same efficient MoE as Maverick
◆4K max output — designed for retrieval/analysis, not long generation

Prix d'entrée

$0.100

per 1M tokens

Prix de sortie

$0.300

per 1M tokens

Entrée en cache

—

per 1M tokens

Entrée batch

—

per 1M tokens

Fenêtre de contexte: 10M

Sortie max: 4,096 tokens

Date de coupure des connaissances: 2024-08

VisionAppel de fonctionAjustement finMode JSONNiveau gratuit

Avantages

10M token context — largest available
Cheapest per-token model
Open-source + fine-tunable

Inconvénients

Only 4K max output
No JSON mode
Quality below proprietary flagships

Performance

Vitesse de sortie~90 tok/s

Limite de débit—

Multimodal

Entrée imageSortie imageEntrée audioSortie audio

Benchmarks

MMLU

81.0%

HumanEval

78.5%

Comparaison côte à côte

Modèle	Niveau	Entrée	Sortie	En cache	Contexte	Sortie max
Llama 4 Maverick	Flagship	$0.200	$0.600	—	1M	4,096
Llama 4 Scout	Mid-tier	$0.100	$0.300	—	10M	4,096