llama-4-scout by deepinfra - AI Model Details, Pricing, and Performance Metrics
llama-4-scout
completionsLlama 4 Scout 17B Instruct (16E) is a mixture-of-experts (MoE) language model developed by Meta, activating 17 billion parameters out of a total of 109B. It supports native multimodal input (text and image) and multilingual output (text and code) across 12 supported languages. Designed for assistant-style interaction and visual reasoning, Scout uses 16 experts per forward pass and features a context length of 10 million tokens, with a training corpus of ~40 trillion tokens. Built for high efficiency and local or commercial deployment, Llama 4 Scout incorporates early fusion for seamless modality integration. It is instruction-tuned for use in multilingual chat, captioning, and image understanding tasks. Released under the Llama 4 Community License, it was last trained on data up to August 2024 and launched publicly on April 5, 2025.
Llama 4 Scout 17B Instruct (16E) is a mixture-of-experts (MoE) language model developed by Meta, activating 17 billion parameters out of a total of 109B. It supports native multimodal input (text and image) and multilingual output (text and code) across 12 supported languages. Designed for assistant-style interaction and visual reasoning, Scout uses 16 experts per forward pass and features a context length of 10 million tokens, with a training corpus of ~40 trillion tokens. Built for high efficiency and local or commercial deployment, Llama 4 Scout incorporates early fusion for seamless modality integration. It is instruction-tuned for use in multilingual chat, captioning, and image understanding tasks. Released under the Llama 4 Community License, it was last trained on data up to August 2024 and launched publicly on April 5, 2025.
| Provider | Input | Output | 
|---|---|---|
| $0.08 / 1M tokens | $0.3 / 1M tokens | |
| $0.11 / 1M tokens | $0.34 / 1M tokens | 
| Providers | Context | Input Price | Output Price | Input Formats | Output Formats | License | 
|---|---|---|---|---|---|---|
| 327680 tokens | $0.08 / 1M tokens | $0.3 / 1M tokens | textimage  | text  | llama_4_community_license_agreement  | |
| 131072 tokens | $0.11 / 1M tokens | $0.34 / 1M tokens | textimage  | text  | Meta Llama Community  | 
Access llama-4-scout through LangDB AI Gateway
Integrate with meta's llama-4-scout and 250+ other models through a unified API. Monitor usage, control costs, and enhance security.
Free tier available • No credit card required
Category Scores
Benchmark Tests
| Metric | AIME  | AA Coding Index  | AAII  | AA Math Index  | GPQA  | HLE  | LiveCodeBench  | MATH-500  | MMLU  | MMLU-Pro  | MMMU  | SciCode  | 
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Score | 28.3  | 16.1  | 28.1  | 14.0  | 57.9  | 4.3  | 29.9  | 84.4  | 79.6  | 74.8  | 69.4  | 17.0  | 
Compare with Similar Models
Code Examples
Integration samples and API usage
Related Models
Similar models from deepinfra