llama-4-scout

completions

bydeepinfra

llama-4-scout

completions

Published by: metaProvider:

deepinfra

Llama 4 Scout 17B Instruct (16E) is a mixture-of-experts (MoE) language model developed by Meta, activating 17 billion parameters out of a total of 109B. It supports native multimodal input (text and image) and multilingual output (text and code) across 12 supported languages. Designed for assistant-style interaction and visual reasoning, Scout uses 16 experts per forward pass and features a context length of 10 million tokens, with a training corpus of ~40 trillion tokens. Built for high efficiency and local or commercial deployment, Llama 4 Scout incorporates early fusion for seamless modality integration. It is instruction-tuned for use in multilingual chat, captioning, and image understanding tasks. Released under the Llama 4 Community License, it was last trained on data up to August 2024 and launched publicly on April 5, 2025.

Released

Apr 5, 2025

Knowledge

Oct 7, 2024

License

llama_4_community_license_agreement

Context

327680

Input

$0.08 / 1M tokens

Output

$0.3 / 1M tokens

Accepts: text, image

Returns: text

Released Apr 5, 2025Knowledge Cutoff: Oct 7, 2024License: llama_4_community_license_agreement

Context: 327680 Input: $0.08 / 1M tokensOutput: $0.3 / 1M tokensAccepts: text, imageReturns: text

Access llama-4-scout through LangDB AI Gateway

Recommended

Integrate with meta's llama-4-scout and 250+ other models through a unified API. Monitor usage, control costs, and enhance security.

Unified API

Cost Optimization

Enterprise Security

Get Started Now

Free tier available • No credit card required

Instant Setup

99.9% Uptime

10,000+Monthly Requests

Benchmark Results for llama-4-scout

Overall Performance: 38.913125 average score across all categories

Detailed Benchmark Scores:

Benchmark	Score	Percentile	Domain
HLE	4.30	Top 67%	General Knowledge
AIME	28.30	Top 48%	Mathematics
GPQA	57.95	Top 54%	STEM (Physics, Chemistry, Biology)
MMLU	79.60	Top 71%	General Knowledge
MMMU	69.40	Top 64%	General Knowledge
SciCode	17.00	Top 83%	Scientific
MATH-500	84.40	Top 51%	Mathematics
MMLU-Pro	74.75	Top 55%	General Knowledge
LiveCodeBench	29.90	Top 62%	Programming
AA Math Index	14.00	Top 78%	Mathematics
AA Coding Index	16.10	Top 87%	Programming
AAII	28.10	Top 60%	General

MMLU Score: 79.60 - Massive Multitask Language Understanding benchmark

GPQA Score: 57.95 - Graduate-level reasoning benchmark

Model Comparison:

Provider: deepinfra

Model Type: completions

Context Size: 327680 tokens

Comparing against 320 models in the database

Category Scores

Benchmark Tests

View Other Benchmarks

HLE

4.3

General Knowledge

AIME

28.3

Mathematics

GPQA

57.9

STEM (Physics, Chemistry, Biology)

MMLU

79.6

General Knowledge

MMMU

69.4

General Knowledge

SciCode

17.0

Scientific

MATH-500

84.4

Mathematics

MMLU-Pro

74.8

General Knowledge

LiveCodeBench

29.9

Programming

AA Math Index

14.0

Mathematics

AA Coding Index

16.1

Programming

AAII

28.1

General

Metric	HLE	AIME	GPQA	MMLU	MMMU	SciCode	MATH-500	MMLU-Pro	LiveCodeBench	AA Math Index	AA Coding Index	AAII
Score	4.3	28.3	57.9	79.6	69.4	17.0	84.4	74.8	29.9	14.0	16.1	28.1

Compare with Similar Models

Code Examples

Integration samples and API usage

Code Samples for llama-4-scout

Python SDK Example:

from openai import OpenAI

client = OpenAI(
    base_url="https://api.us-east-1.langdb.ai//projects/<your_project_id>",
    api_key="<your_api_key>"
)

response = client.chat.completions.create(
    model="llama-4-scout",
    messages=[
        {"role": "user", "content": "Hello, how are you?"}
    ]
)

print(response.choices[0].message.content)

TypeScript SDK Example:

import OpenAI from 'openai';

const client = new OpenAI({
    baseURL: "https://api.us-east-1.langdb.ai//projects/<your_project_id>",
    apiKey: "<your_api_key>"
});

const response = await client.chat.completions.create({
    model: "llama-4-scout",
    messages: [
        { role: "user", content: "Hello, how are you?" }
    ]
});

console.log(response.choices[0].message.content);

cURL Example:

curl -X POST "https://api.us-east-1.langdb.ai//projects/<your_project_id>/v1/chat/completions" \
  -H "Authorization: Bearer <your_api_key>" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "llama-4-scout",
    "messages": [
        {"role": "user", "content": "Hello, how are you?"}
    ]
  }'

Model: llama-4-scout

Provider: deepinfra

API Endpoint: $https://api.us-east-1.langdb.ai/

Create API Key

Related Models

Similar models from deepinfra

llama-4-scout

llama-4-scout

Access llama-4-scout through LangDB AI Gateway

Category Scores

Benchmark Tests

Compare with Similar Models

Code Examples

Related Models

deepseek-chat-v3-0324

deepseek-chat-v3.1

deepseek-prover-v2

DeepSeek-R1

deepseek-r1-0528

DeepSeek-R1-Distill-Llama-70B

llama-4-scout by deepinfra - AI Model Details, Pricing, and Performance Metrics

llama-4-scout

llama-4-scout

Access llama-4-scout through LangDB AI Gateway

Category Scores

Benchmark Tests

Compare with Similar Models

Code Examples

Related Models

deepseek-chat-v3-0324

deepseek-chat-v3.1

deepseek-prover-v2

DeepSeek-R1

deepseek-r1-0528

DeepSeek-R1-Distill-Llama-70B