llama-3.2-1b-instruct by deepinfra - AI Model Details, Pricing, and Performance Metrics

meta
llama-3.2-1b-instruct
meta

llama-3.2-1b-instruct

completions
bydeepinfra

Llama 3.2 1B is a 1-billion-parameter language model focused on efficiently performing natural language tasks, such as summarization, dialogue, and multilingual text analysis. Its smaller size allows it to operate efficiently in low-resource environments while maintaining strong task performance. Supporting eight core languages and fine-tunable for more, Llama 1.3B is ideal for businesses or developers seeking lightweight yet powerful AI solutions that can operate in diverse multilingual settings without the high computational demand of larger models. Click here for the [original model card](https://github.com/meta-llama/llama-models/blob/main/models/llama3_2/MODEL_CARD.md). Usage of this model is subject to [Meta's Acceptable Use Policy](https://www.llama.com/llama3/use-policy/).

Context
131072
Input
<$0.01 / 1M tokens
Output
$0.01 / 1M tokens
Accepts: text
Returns: text

Access llama-3.2-1b-instruct through LangDB AI Gateway

Recommended

Integrate with meta's llama-3.2-1b-instruct and 250+ other models through a unified API. Monitor usage, control costs, and enhance security.

Unified API
Cost Optimization
Enterprise Security
Get Started Now

Free tier available • No credit card required

Instant Setup
99.9% Uptime
10,000+Monthly Requests
Request Volume
Daily API requests
2
Performance (TPS)
Tokens per second
144.99 tokens/s

Code Examples

Integration samples and API usage