llama-3.2-3b-instruct

completions

bydeepinfra

llama-3.2-3b-instruct

completions

Published by: metaProvider:

deepinfra

Llama 3.2 3B is a 3-billion-parameter multilingual large language model, optimized for advanced natural language processing tasks like dialogue generation, reasoning, and summarization. Designed with the latest transformer architecture, it supports eight languages, including English, Spanish, and Hindi, and is adaptable for additional languages. Trained on 9 trillion tokens, the Llama 3.2 3B model excels in instruction-following, complex reasoning, and tool use. Its balanced performance makes it ideal for applications needing accuracy and efficiency in text generation across multilingual settings. Click here for the [original model card](https://github.com/meta-llama/llama-models/blob/main/models/llama3_2/MODEL_CARD.md). Usage of this model is subject to [Meta's Acceptable Use Policy](https://www.llama.com/llama3/use-policy/).

Released

Sep 25, 2024

Knowledge

Mar 29, 2024

License

llama_3_2_community_license

Context

131072

Input

$0.01 / 1M tokens

Output

$0.02 / 1M tokens

Accepts: text

Returns: text

Released Sep 25, 2024Knowledge Cutoff: Mar 29, 2024License: llama_3_2_community_license

Context: 131072 Input: $0.01 / 1M tokensOutput: $0.02 / 1M tokensAccepts: textReturns: text

Access llama-3.2-3b-instruct through LangDB AI Gateway

Recommended

Integrate with meta's llama-3.2-3b-instruct and 250+ other models through a unified API. Monitor usage, control costs, and enhance security.

Unified API

Cost Optimization

Enterprise Security

Get Started Now

Free tier available • No credit card required

Instant Setup

99.9% Uptime

10,000+Monthly Requests

Benchmark Results for llama-3.2-3b-instruct

Category Performance Scores:

Maths: Score 3.30 (Top 97% - Rank #338)
Finance: Score 6.50 (Top 98% - Rank #342)
Science: Score 30.47 (Top 96% - Rank #335)
Writing: Score 21.16 (Top 98% - Rank #342)
Academia: Score 21.25 (Top 94% - Rank #328)
Marketing: Score 19.74 (Top 97% - Rank #338)

Overall Performance: 17.07203703703704 average score across all categories

Detailed Benchmark Scores:

Benchmark	Score	Percentile	Domain
HLE	5.20	Top 55%	General Knowledge
AIME	6.70	Top 81%	Mathematics
GPQA	32.80	Top 94%	STEM (Physics, Chemistry, Biology)
SciCode	5.20	Top 97%	Scientific
MATH-500	48.90	Top 93%	Mathematics
MMLU-Pro	34.70	Top 97%	General Knowledge
LiveCodeBench	8.30	Top 96%	Programming
AA Math Index	3.30	Top 97%	Mathematics
AAII	9.70	Top 94%	General

GPQA Score: 32.80 - Graduate-level reasoning benchmark

Model Comparison:

Provider: deepinfra

Model Type: completions

Context Size: 131072 tokens

Comparing against 348 models in the database

Category Scores

Benchmark Tests

View Other Benchmarks

HLE

5.2

General Knowledge

AIME

6.7

Mathematics

GPQA

32.8

STEM (Physics, Chemistry, Biology)

SciCode

5.2

Scientific

MATH-500

48.9

Mathematics

MMLU-Pro

34.7

General Knowledge

LiveCodeBench

8.3

Programming

AA Math Index

3.3

Mathematics

AAII

9.7

General

Metric	HLE	AIME	GPQA	SciCode	MATH-500	MMLU-Pro	LiveCodeBench	AA Math Index	AAII
Score	5.2	6.7	32.8	5.2	48.9	34.7	8.3	3.3	9.7

Compare with Similar Models

claude-opus-4.5

claude-opus-4.6

gemini-3-flash-preview

claude-sonnet-4.5

gemini-3-pro-preview

claude-sonnet-4

Code Examples

Integration samples and API usage

Code Samples for llama-3.2-3b-instruct

Python SDK Example:

from openai import OpenAI

client = OpenAI(
    base_url="https://api.langdb.ai/projects/<your_project_id>",
    api_key="<your_api_key>"
)

response = client.chat.completions.create(
    model="llama-3.2-3b-instruct",
    messages=[
        {"role": "user", "content": "Hello, how are you?"}
    ]
)

print(response.choices[0].message.content)

TypeScript SDK Example:

import OpenAI from 'openai';

const client = new OpenAI({
    baseURL: "https://api.langdb.ai/projects/<your_project_id>",
    apiKey: "<your_api_key>"
});

const response = await client.chat.completions.create({
    model: "llama-3.2-3b-instruct",
    messages: [
        { role: "user", content: "Hello, how are you?" }
    ]
});

console.log(response.choices[0].message.content);

cURL Example:

curl -X POST "https://api.langdb.ai/projects/<your_project_id>/v1/chat/completions" \
  -H "Authorization: Bearer <your_api_key>" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "llama-3.2-3b-instruct",
    "messages": [
        {"role": "user", "content": "Hello, how are you?"}
    ]
  }'

Model: llama-3.2-3b-instruct

Provider: deepinfra

API Endpoint: $https://api.langdb.ai

Create API Key

Related Models

Similar models from deepinfra

llama-3.2-3b-instruct

llama-3.2-3b-instruct

Access llama-3.2-3b-instruct through LangDB AI Gateway

Category Scores

Benchmark Tests

Compare with Similar Models

Code Examples

Related Models

deepseek-chat-v3-0324

deepseek-chat-v3.1

deepseek-prover-v2

DeepSeek-R1

deepseek-r1-0528

DeepSeek-R1-Distill-Llama-70B

llama-3.2-3b-instruct by deepinfra - AI Model Details, Pricing, and Performance Metrics

llama-3.2-3b-instruct

llama-3.2-3b-instruct

Access llama-3.2-3b-instruct through LangDB AI Gateway

Category Scores

Benchmark Tests

Compare with Similar Models

Code Examples

Related Models

deepseek-chat-v3-0324

deepseek-chat-v3.1

deepseek-prover-v2

DeepSeek-R1

deepseek-r1-0528

DeepSeek-R1-Distill-Llama-70B