llama-3.1-nemotron-70b-instruct

completions

bydeepinfra

llama-3.1-nemotron-70b-instruct

completions

Published by: nvidiaProvider:

deepinfra

NVIDIA's Llama 3.1 Nemotron 70B is a language model designed for generating precise and useful responses. Leveraging [Llama 3.1 70B](/models/meta-llama/llama-3.1-70b-instruct) architecture and Reinforcement Learning from Human Feedback (RLHF), it excels in automatic alignment benchmarks. This model is tailored for applications requiring high accuracy in helpfulness and response generation, suitable for diverse user queries across multiple domains. Usage of this model is subject to [Meta's Acceptable Use Policy](https://www.llama.com/llama3/use-policy/).

Released

Oct 1, 2024

Knowledge

Dec 1, 2023

License

llama_3_1_community_license

Context

131072

Input

$0.12 / 1M tokens

Output

$0.3 / 1M tokens

Capabilities: tools

Accepts: text

Returns: text

Released Oct 1, 2024Knowledge Cutoff: Dec 1, 2023License: llama_3_1_community_license

Context: 131072 Input: $0.12 / 1M tokensOutput: $0.3 / 1M tokensCapabilities: toolsAccepts: textReturns: text

Access llama-3.1-nemotron-70b-instruct through LangDB AI Gateway

Recommended

Integrate with nvidia's llama-3.1-nemotron-70b-instruct and 250+ other models through a unified API. Monitor usage, control costs, and enhance security.

Unified API

Cost Optimization

Enterprise Security

Get Started Now

Free tier available • No credit card required

Instant Setup

99.9% Uptime

10,000+Monthly Requests

Benchmark Results for llama-3.1-nemotron-70b-instruct

Category Performance Scores:

Maths: Score 11.00 (Top 86% - Rank #300)
Finance: Score 12.20 (Top 85% - Rank #296)
Science: Score 50.59 (Top 78% - Rank #272)
Writing: Score 34.11 (Top 89% - Rank #310)
Academia: Score 29.95 (Top 79% - Rank #275)
Marketing: Score 32.23 (Top 89% - Rank #310)
Programming: Score 10.80 (Top 84% - Rank #293)

Overall Performance: 25.839920634920638 average score across all categories

Detailed Benchmark Scores:

Benchmark	Score	Percentile	Domain
HLE	4.60	Top 65%	General Knowledge
AIME	24.70	Top 58%	Mathematics
GPQA	46.50	Top 79%	STEM (Physics, Chemistry, Biology)
SciCode	23.30	Top 76%	Scientific
MATH-500	73.30	Top 70%	Mathematics
MMLU-Pro	69.00	Top 73%	General Knowledge
LiveCodeBench	16.90	Top 84%	Programming
AA Math Index	11.00	Top 86%	Mathematics
AA Coding Index	10.80	Top 84%	Programming
AAII	13.40	Top 77%	General

GPQA Score: 46.50 - Graduate-level reasoning benchmark

Model Comparison:

Provider: deepinfra

Model Type: completions

Context Size: 131072 tokens

Comparing against 348 models in the database

Category Scores

Benchmark Tests

View Other Benchmarks

HLE

4.6

General Knowledge

AIME

24.7

Mathematics

GPQA

46.5

STEM (Physics, Chemistry, Biology)

SciCode

23.3

Scientific

MATH-500

73.3

Mathematics

MMLU-Pro

69.0

General Knowledge

LiveCodeBench

16.9

Programming

AA Math Index

11.0

Mathematics

AA Coding Index

10.8

Programming

AAII

13.4

General

Metric	HLE	AIME	GPQA	SciCode	MATH-500	MMLU-Pro	LiveCodeBench	AA Math Index	AA Coding Index	AAII
Score	4.6	24.7	46.5	23.3	73.3	69.0	16.9	11.0	10.8	13.4

Compare with Similar Models

claude-opus-4.5

claude-opus-4.6

gemini-3-flash-preview

claude-sonnet-4.5

gemini-3-pro-preview

claude-sonnet-4

Code Examples

Integration samples and API usage

Code Samples for llama-3.1-nemotron-70b-instruct

Python SDK Example:

from openai import OpenAI

client = OpenAI(
    base_url="https://api.us-east-1.langdb.ai/projects/<your_project_id>",
    api_key="<your_api_key>"
)

response = client.chat.completions.create(
    model="llama-3.1-nemotron-70b-instruct",
    messages=[
        {"role": "user", "content": "Hello, how are you?"}
    ]
)

print(response.choices[0].message.content)

TypeScript SDK Example:

import OpenAI from 'openai';

const client = new OpenAI({
    baseURL: "https://api.us-east-1.langdb.ai/projects/<your_project_id>",
    apiKey: "<your_api_key>"
});

const response = await client.chat.completions.create({
    model: "llama-3.1-nemotron-70b-instruct",
    messages: [
        { role: "user", content: "Hello, how are you?" }
    ]
});

console.log(response.choices[0].message.content);

cURL Example:

curl -X POST "https://api.us-east-1.langdb.ai/projects/<your_project_id>/v1/chat/completions" \
  -H "Authorization: Bearer <your_api_key>" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "llama-3.1-nemotron-70b-instruct",
    "messages": [
        {"role": "user", "content": "Hello, how are you?"}
    ]
  }'

Model: llama-3.1-nemotron-70b-instruct

Provider: deepinfra

API Endpoint: $https://api.us-east-1.langdb.ai

Create API Key

Related Models

Similar models from deepinfra

llama-3.1-nemotron-70b-instruct

llama-3.1-nemotron-70b-instruct

Access llama-3.1-nemotron-70b-instruct through LangDB AI Gateway

Category Scores

Benchmark Tests

Compare with Similar Models

Code Examples

Related Models

deepseek-chat-v3-0324

deepseek-chat-v3.1

deepseek-prover-v2

DeepSeek-R1

deepseek-r1-0528

DeepSeek-R1-Distill-Llama-70B

llama-3.1-nemotron-70b-instruct by deepinfra - AI Model Details, Pricing, and Performance Metrics

llama-3.1-nemotron-70b-instruct

llama-3.1-nemotron-70b-instruct

Access llama-3.1-nemotron-70b-instruct through LangDB AI Gateway

Category Scores

Benchmark Tests

Compare with Similar Models

Code Examples

Related Models

deepseek-chat-v3-0324

deepseek-chat-v3.1

deepseek-prover-v2

DeepSeek-R1

deepseek-r1-0528

DeepSeek-R1-Distill-Llama-70B