llama-3.2-3b-instruct by deepinfra - AI Model Details, Pricing, and Performance Metrics

meta-llama
llama-3.2-3b-instruct
meta-llama

llama-3.2-3b-instruct

completions
bydeepinfra

Llama 3.2 3B is a 3-billion-parameter multilingual large language model, optimized for advanced natural language processing tasks like dialogue generation, reasoning, and summarization. Designed with the latest transformer architecture, it supports eight languages, including English, Spanish, and Hindi, and is adaptable for additional languages. Trained on 9 trillion tokens, the Llama 3.2 3B model excels in instruction-following, complex reasoning, and tool use. Its balanced performance makes it ideal for applications needing accuracy and efficiency in text generation across multilingual settings. Click here for the [original model card](https://github.com/meta-llama/llama-models/blob/main/models/llama3_2/MODEL_CARD.md). Usage of this model is subject to [Meta's Acceptable Use Policy](https://www.llama.com/llama3/use-policy/).

Released
Sep 25, 2024
Knowledge
Mar 29, 2024
License
llama_3_2_community_license
Context
131072
Input
$0.01 / 1M tokens
Output
$0.02 / 1M tokens
Accepts: text
Returns: text

Access llama-3.2-3b-instruct through LangDB AI Gateway

Recommended

Integrate with meta-llama's llama-3.2-3b-instruct and 250+ other models through a unified API. Monitor usage, control costs, and enhance security.

Unified API
Cost Optimization
Enterprise Security
Get Started Now

Free tier available • No credit card required

Instant Setup
99.9% Uptime
10,000+Monthly Requests
Request Volume
Daily API requests
1
Performance (TPS)
Tokens per second
999.74 tokens/s

Category Scores

Benchmark Tests

View Other Benchmarks
AIME
6.7
Mathematics
AA Coding Index
6.7
Programming
AAII
11.2
General
AA Math Index
27.8
Mathematics
GPQA
32.8
STEM (Physics, Chemistry, Biology)
HLE
5.2
General Knowledge
LiveCodeBench
8.3
Programming
MATH-500
48.9
Mathematics
MMLU
63.4
General Knowledge
MMLU-Pro
34.7
General Knowledge
SciCode
5.2
Scientific

Code Examples

Integration samples and API usage