nemotron-nano-9b-v2

completions

bydeepinfra

nemotron-nano-9b-v2

completions

Published by: nvidiaProvider:

deepinfra

NVIDIA-Nemotron-Nano-9B-v2 is a large language model (LLM) trained from scratch by NVIDIA, and designed as a unified model for both reasoning and non-reasoning tasks. It responds to user queries and tasks by first generating a reasoning trace and then concluding with a final response. The model's reasoning capabilities can be controlled via a system prompt. If the user prefers the model to provide its final answer without intermediate reasoning traces, it can be configured to do so.

Released

Aug 18, 2025

Knowledge

Feb 19, 2025

License

nvidia_open_model_license_agreement

Context

128K

Input

$0.04 / 1M tokens

Output

$0.16 / 1M tokens

Capabilities: tools

Accepts: text

Returns: text

Released Aug 18, 2025Knowledge Cutoff: Feb 19, 2025License: nvidia_open_model_license_agreement

Context: 128K Input: $0.04 / 1M tokensOutput: $0.16 / 1M tokensCapabilities: toolsAccepts: textReturns: text

Access nemotron-nano-9b-v2 through LangDB AI Gateway

Recommended

Integrate with nvidia's nemotron-nano-9b-v2 and 250+ other models through a unified API. Monitor usage, control costs, and enhance security.

Unified API

Cost Optimization

Enterprise Security

Get Started Now

Free tier available • No credit card required

Instant Setup

99.9% Uptime

10,000+Monthly Requests

Benchmark Results for nemotron-nano-9b-v2

Category Performance Scores:

Maths: Score 69.70 (Top 38% - Rank #133)
Finance: Score 42.25 (Top 45% - Rank #157)
Science: Score 57.87 (Top 69% - Rank #241)
Writing: Score 39.71 (Top 76% - Rank #265)
Academia: Score 35.90 (Top 69% - Rank #241)
Marketing: Score 37.46 (Top 74% - Rank #258)
Programming: Score 8.30 (Top 91% - Rank #317)

Overall Performance: 41.59809523809524 average score across all categories

Detailed Benchmark Scores:

Benchmark	Score	Percentile	Domain
HLE	4.60	Top 65%	General Knowledge
GPQA	57.00	Top 70%	STEM (Physics, Chemistry, Biology)
SciCode	22.00	Top 82%	Scientific
MMLU-Pro	74.20	Top 64%	General Knowledge
LiveCodeBench	72.40	Top 24%	Programming
AA Math Index	69.70	Top 38%	Mathematics
AA Coding Index	8.30	Top 91%	Programming
AAII	14.80	Top 72%	General

GPQA Score: 57.00 - Graduate-level reasoning benchmark

Model Comparison:

Provider: deepinfra

Model Type: completions

Context Size: 128000 tokens

Comparing against 348 models in the database

Category Scores

Benchmark Tests

View Other Benchmarks

HLE

4.6

General Knowledge

GPQA

57.0

STEM (Physics, Chemistry, Biology)

SciCode

22.0

Scientific

MMLU-Pro

74.2

General Knowledge

LiveCodeBench

72.4

Programming

AA Math Index

69.7

Mathematics

AA Coding Index

8.3

Programming

AAII

14.8

General

Metric	HLE	GPQA	SciCode	MMLU-Pro	LiveCodeBench	AA Math Index	AA Coding Index	AAII
Score	4.6	57.0	22.0	74.2	72.4	69.7	8.3	14.8

Compare with Similar Models

claude-opus-4.5

claude-opus-4.6

gemini-3-flash-preview

claude-sonnet-4.5

gemini-3-pro-preview

claude-sonnet-4

Code Examples

Integration samples and API usage

Code Samples for nemotron-nano-9b-v2

Python SDK Example:

from openai import OpenAI

client = OpenAI(
    base_url="https://api.langdb.ai/projects/<your_project_id>",
    api_key="<your_api_key>"
)

response = client.chat.completions.create(
    model="nemotron-nano-9b-v2",
    messages=[
        {"role": "user", "content": "Hello, how are you?"}
    ]
)

print(response.choices[0].message.content)

TypeScript SDK Example:

import OpenAI from 'openai';

const client = new OpenAI({
    baseURL: "https://api.langdb.ai/projects/<your_project_id>",
    apiKey: "<your_api_key>"
});

const response = await client.chat.completions.create({
    model: "nemotron-nano-9b-v2",
    messages: [
        { role: "user", content: "Hello, how are you?" }
    ]
});

console.log(response.choices[0].message.content);

cURL Example:

curl -X POST "https://api.langdb.ai/projects/<your_project_id>/v1/chat/completions" \
  -H "Authorization: Bearer <your_api_key>" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "nemotron-nano-9b-v2",
    "messages": [
        {"role": "user", "content": "Hello, how are you?"}
    ]
  }'

Model: nemotron-nano-9b-v2

Provider: deepinfra

API Endpoint: $https://api.langdb.ai

Create API Key

Related Models

Similar models from deepinfra

nemotron-nano-9b-v2

nemotron-nano-9b-v2

Access nemotron-nano-9b-v2 through LangDB AI Gateway

Category Scores

Benchmark Tests

Compare with Similar Models

Code Examples

Related Models

deepseek-chat-v3-0324

deepseek-chat-v3.1

deepseek-prover-v2

DeepSeek-R1

deepseek-r1-0528

DeepSeek-R1-Distill-Llama-70B

nemotron-nano-9b-v2 by deepinfra - AI Model Details, Pricing, and Performance Metrics

nemotron-nano-9b-v2

nemotron-nano-9b-v2

Access nemotron-nano-9b-v2 through LangDB AI Gateway

Category Scores

Benchmark Tests

Compare with Similar Models

Code Examples

Related Models

deepseek-chat-v3-0324

deepseek-chat-v3.1

deepseek-prover-v2

DeepSeek-R1

deepseek-r1-0528

DeepSeek-R1-Distill-Llama-70B