llama-4-scout

completions

bydeepinfra

llama-4-scout

completions

Published by: metaProvider:

deepinfra

Llama 4 Scout 17B Instruct (16E) is a mixture-of-experts (MoE) language model developed by Meta, activating 17 billion parameters out of a total of 109B. It supports native multimodal input (text and image) and multilingual output (text and code) across 12 supported languages. Designed for assistant-style interaction and visual reasoning, Scout uses 16 experts per forward pass and features a context length of 10 million tokens, with a training corpus of ~40 trillion tokens. Built for high efficiency and local or commercial deployment, Llama 4 Scout incorporates early fusion for seamless modality integration. It is instruction-tuned for use in multilingual chat, captioning, and image understanding tasks. Released under the Llama 4 Community License, it was last trained on data up to August 2024 and launched publicly on April 5, 2025.

Released

Apr 5, 2025

Knowledge

Oct 7, 2024

License

llama_4_community_license_agreement

Context

327680

Input

$0.08 / 1M tokens

Output

$0.3 / 1M tokens

Accepts: text, image

Returns: text

Released Apr 5, 2025Knowledge Cutoff: Oct 7, 2024License: llama_4_community_license_agreement

Context: 327680 Input: $0.08 / 1M tokensOutput: $0.3 / 1M tokensAccepts: text, imageReturns: text

Access llama-4-scout through LangDB AI Gateway

Recommended

Integrate with meta's llama-4-scout and 250+ other models through a unified API. Monitor usage, control costs, and enhance security.

Unified API

Cost Optimization

Enterprise Security

Get Started Now

Free tier available • No credit card required

Instant Setup

99.9% Uptime

10,000+Monthly Requests

Benchmark Results for llama-4-scout

Overall Performance: 35.778437499999995 average score across all categories

Detailed Benchmark Scores:

Benchmark	Score	Percentile	Domain
HLE	4.30	Top 73%	General Knowledge
AIME	28.30	Top 53%	Mathematics
GPQA	57.95	Top 66%	STEM (Physics, Chemistry, Biology)
MMMU	69.40	Top 65%	General Knowledge
SciCode	17.00	Top 88%	Scientific
MATH-500	84.40	Top 55%	Mathematics
MMLU-Pro	75.20	Top 59%	General Knowledge
LiveCodeBench	29.90	Top 69%	Programming
AA Math Index	14.00	Top 82%	Mathematics
AA Coding Index	6.70	Top 94%	Programming
AAII	13.50	Top 75%	General

GPQA Score: 57.95 - Graduate-level reasoning benchmark

Model Comparison:

Provider: deepinfra

Model Type: completions

Context Size: 327680 tokens

Comparing against 348 models in the database

Category Scores

Benchmark Tests

View Other Benchmarks

HLE

4.3

General Knowledge

AIME

28.3

Mathematics

GPQA

57.9

STEM (Physics, Chemistry, Biology)

MMMU

69.4

General Knowledge

SciCode

17.0

Scientific

MATH-500

84.4

Mathematics

MMLU-Pro

75.2

General Knowledge

LiveCodeBench

29.9

Programming

AA Math Index

14.0

Mathematics

AA Coding Index

6.7

Programming

AAII

13.5

General

Metric	HLE	AIME	GPQA	MMMU	SciCode	MATH-500	MMLU-Pro	LiveCodeBench	AA Math Index	AA Coding Index	AAII
Score	4.3	28.3	57.9	69.4	17.0	84.4	75.2	29.9	14.0	6.7	13.5

Compare with Similar Models

claude-opus-4.5

claude-opus-4.6

gemini-3-flash-preview

claude-sonnet-4.5

gemini-3-pro-preview

claude-sonnet-4

Code Examples

Integration samples and API usage

Code Samples for llama-4-scout

Python SDK Example:

from openai import OpenAI

client = OpenAI(
    base_url="https://api.langdb.ai/projects/<your_project_id>",
    api_key="<your_api_key>"
)

response = client.chat.completions.create(
    model="llama-4-scout",
    messages=[
        {"role": "user", "content": "Hello, how are you?"}
    ]
)

print(response.choices[0].message.content)

TypeScript SDK Example:

import OpenAI from 'openai';

const client = new OpenAI({
    baseURL: "https://api.langdb.ai/projects/<your_project_id>",
    apiKey: "<your_api_key>"
});

const response = await client.chat.completions.create({
    model: "llama-4-scout",
    messages: [
        { role: "user", content: "Hello, how are you?" }
    ]
});

console.log(response.choices[0].message.content);

cURL Example:

curl -X POST "https://api.langdb.ai/projects/<your_project_id>/v1/chat/completions" \
  -H "Authorization: Bearer <your_api_key>" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "llama-4-scout",
    "messages": [
        {"role": "user", "content": "Hello, how are you?"}
    ]
  }'

Model: llama-4-scout

Provider: deepinfra

API Endpoint: $https://api.langdb.ai

Create API Key

Related Models

Similar models from deepinfra

llama-4-scout

llama-4-scout

Access llama-4-scout through LangDB AI Gateway

Category Scores

Benchmark Tests

Compare with Similar Models

Code Examples

Related Models

deepseek-chat-v3-0324

deepseek-chat-v3.1

deepseek-prover-v2

DeepSeek-R1

deepseek-r1-0528

DeepSeek-R1-Distill-Llama-70B

llama-4-scout by deepinfra - AI Model Details, Pricing, and Performance Metrics

llama-4-scout

llama-4-scout

Access llama-4-scout through LangDB AI Gateway

Category Scores

Benchmark Tests

Compare with Similar Models

Code Examples

Related Models

deepseek-chat-v3-0324

deepseek-chat-v3.1

deepseek-prover-v2

DeepSeek-R1

deepseek-r1-0528

DeepSeek-R1-Distill-Llama-70B