Llama-3.1-Nemotron-70B-Instruct-HF

completions

bytogetherai

Llama-3.1-Nemotron-70B-Instruct-HF

completions

Published by: nvidiaProvider:

togetherai

Llama-3.1-Nemotron-70B-Instruct is a large language model customized by NVIDIA in order to improve the helpfulness of LLM generated responses.

Released

Oct 1, 2024

Knowledge

Dec 1, 2023

License

Meta Llama Community

Context

32768

Input

$0.9 / 1M tokens

Output

$0.9 / 1M tokens

Accepts: text

Returns: text

Released Oct 1, 2024Knowledge Cutoff: Dec 1, 2023License: Meta Llama Community

Context: 32768 Input: $0.9 / 1M tokensOutput: $0.9 / 1M tokensAccepts: textReturns: text

Access Llama-3.1-Nemotron-70B-Instruct-HF through LangDB AI Gateway

Recommended

Integrate with nvidia's Llama-3.1-Nemotron-70B-Instruct-HF and 250+ other models through a unified API. Monitor usage, control costs, and enhance security.

Unified API

Cost Optimization

Enterprise Security

Get Started Now

Free tier available • No credit card required

Instant Setup

99.9% Uptime

10,000+Monthly Requests

Benchmark Results for Llama-3.1-Nemotron-70B-Instruct-HF

Category Performance Scores:

Maths: Score 11.00 (Top 86% - Rank #300)
Finance: Score 12.20 (Top 85% - Rank #296)
Science: Score 50.59 (Top 78% - Rank #272)
Writing: Score 34.11 (Top 89% - Rank #310)
Academia: Score 29.95 (Top 79% - Rank #275)
Marketing: Score 32.23 (Top 89% - Rank #310)
Programming: Score 10.80 (Top 84% - Rank #293)

Overall Performance: 25.839920634920638 average score across all categories

Detailed Benchmark Scores:

Benchmark	Score	Percentile	Domain
HLE	4.60	Top 65%	General Knowledge
AIME	24.70	Top 58%	Mathematics
GPQA	46.50	Top 79%	STEM (Physics, Chemistry, Biology)
SciCode	23.30	Top 76%	Scientific
MATH-500	73.30	Top 70%	Mathematics
MMLU-Pro	69.00	Top 73%	General Knowledge
LiveCodeBench	16.90	Top 84%	Programming
AA Math Index	11.00	Top 86%	Mathematics
AA Coding Index	10.80	Top 84%	Programming
AAII	13.40	Top 77%	General

GPQA Score: 46.50 - Graduate-level reasoning benchmark

Model Comparison:

Provider: togetherai

Model Type: completions

Context Size: 32768 tokens

Comparing against 348 models in the database

Category Scores

Benchmark Tests

View Other Benchmarks

HLE

4.6

General Knowledge

AIME

24.7

Mathematics

GPQA

46.5

STEM (Physics, Chemistry, Biology)

SciCode

23.3

Scientific

MATH-500

73.3

Mathematics

MMLU-Pro

69.0

General Knowledge

LiveCodeBench

16.9

Programming

AA Math Index

11.0

Mathematics

AA Coding Index

10.8

Programming

AAII

13.4

General

Metric	HLE	AIME	GPQA	SciCode	MATH-500	MMLU-Pro	LiveCodeBench	AA Math Index	AA Coding Index	AAII
Score	4.6	24.7	46.5	23.3	73.3	69.0	16.9	11.0	10.8	13.4

Compare with Similar Models

claude-opus-4.5

claude-opus-4.6

gemini-3-flash-preview

claude-sonnet-4.5

gemini-3-pro-preview

claude-sonnet-4

Code Examples

Integration samples and API usage

Code Samples for Llama-3.1-Nemotron-70B-Instruct-HF

Python SDK Example:

from openai import OpenAI

client = OpenAI(
    base_url="https://api.langdb.ai/projects/<your_project_id>",
    api_key="<your_api_key>"
)

response = client.chat.completions.create(
    model="Llama-3.1-Nemotron-70B-Instruct-HF",
    messages=[
        {"role": "user", "content": "Hello, how are you?"}
    ]
)

print(response.choices[0].message.content)

TypeScript SDK Example:

import OpenAI from 'openai';

const client = new OpenAI({
    baseURL: "https://api.langdb.ai/projects/<your_project_id>",
    apiKey: "<your_api_key>"
});

const response = await client.chat.completions.create({
    model: "Llama-3.1-Nemotron-70B-Instruct-HF",
    messages: [
        { role: "user", content: "Hello, how are you?" }
    ]
});

console.log(response.choices[0].message.content);

cURL Example:

curl -X POST "https://api.langdb.ai/projects/<your_project_id>/v1/chat/completions" \
  -H "Authorization: Bearer <your_api_key>" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "Llama-3.1-Nemotron-70B-Instruct-HF",
    "messages": [
        {"role": "user", "content": "Hello, how are you?"}
    ]
  }'

Model: Llama-3.1-Nemotron-70B-Instruct-HF

Provider: togetherai

API Endpoint: $https://api.langdb.ai

Create API Key

Related Models

Similar models from togetherai

Llama-3.1-Nemotron-70B-Instruct-HF

Llama-3.1-Nemotron-70B-Instruct-HF

Access Llama-3.1-Nemotron-70B-Instruct-HF through LangDB AI Gateway

Category Scores

Benchmark Tests

Compare with Similar Models

Code Examples

Related Models

DeepSeek-R1-Distill-Qwen-14B

DeepSeek-R1-Distill-Qwen-1.5B

gemma-2-27b-it

Nous-Hermes-2-Mixtral-8x7B-DPO

Qwen2.5-72B-Instruct-Turbo

Qwen2.5-7B-Instruct-Turbo

Llama-3.1-Nemotron-70B-Instruct-HF by togetherai - AI Model Details, Pricing, and Performance Metrics

Llama-3.1-Nemotron-70B-Instruct-HF

Llama-3.1-Nemotron-70B-Instruct-HF

Access Llama-3.1-Nemotron-70B-Instruct-HF through LangDB AI Gateway

Category Scores

Benchmark Tests

Compare with Similar Models

Code Examples

Related Models

DeepSeek-R1-Distill-Qwen-14B

DeepSeek-R1-Distill-Qwen-1.5B

gemma-2-27b-it

Nous-Hermes-2-Mixtral-8x7B-DPO

Qwen2.5-72B-Instruct-Turbo

Qwen2.5-7B-Instruct-Turbo