nemotron-3-super-120b-a12b

completions

byopenrouter

nemotron-3-super-120b-a12b

completions

Published by: nvidiaProvider:

openrouter

NVIDIA Nemotron 3 Super is a 120B-parameter open hybrid MoE model, activating just 12B parameters for maximum compute efficiency and accuracy in complex multi-agent applications. Built on a hybrid Mamba-Transformer Mixture-of-Experts architecture with multi-token prediction (MTP), it delivers over 50% higher token generation compared to leading open models. The model features a 1M token context window for long-term agent coherence, cross-document reasoning, and multi-step task planning. Latent MoE enables calling 4 experts for the inference cost of only one, improving intelligence and generalization. Multi-environment RL training across 10+ environments delivers leading accuracy on benchmarks including AIME 2025, TerminalBench, and SWE-Bench Verified. Fully open with weights, datasets, and recipes under the NVIDIA Open License, Nemotron 3 Super allows easy customization and secure deployment anywhere — from workstation to cloud.

Released

Mar 11, 2026

Knowledge

Jun 1, 2025

License

nvidia_open_model_license_agreement

Context

262144

Input

$0.2 / 1M tokens

Output

$0.7 / 1M tokens

Cached

$0.1 / 1M tokens

Capabilities: tools, reasoning

Accepts: text

Returns: text

Released Mar 11, 2026Knowledge Cutoff: Jun 1, 2025License: nvidia_open_model_license_agreement

Context: 262144 Input: $0.2 / 1M tokensOutput: $0.7 / 1M tokensCached: $0.1 / 1M tokensCapabilities: tools, reasoningAccepts: textReturns: text

Access nemotron-3-super-120b-a12b through LangDB AI Gateway

Recommended

Integrate with nvidia's nemotron-3-super-120b-a12b and 250+ other models through a unified API. Monitor usage, control costs, and enhance security.

Unified API

Cost Optimization

Enterprise Security

Get Started Now

Free tier available • No credit card required

Instant Setup

99.9% Uptime

10,000+Monthly Requests

Benchmark Results for nemotron-3-super-120b-a12b

Category Performance Scores:

Science: Score 75.37 (Top 21% - Rank #74)
Writing: Score 67.93 (Top 16% - Rank #56)
Academia: Score 58.67 (Top 23% - Rank #81)
Marketing: Score 68.05 (Top 11% - Rank #39)
Programming: Score 31.20 (Top 27% - Rank #94)

Overall Performance: 60.245000000000005 average score across all categories

Detailed Benchmark Scores:

Benchmark	Score	Percentile	Domain
HLE	19.20	Top 17%	General Knowledge
GPQA	81.35	Top 22%	STEM (Physics, Chemistry, Biology)
SciCode	36.00	Top 41%	Scientific
MMLU-Pro	83.73	Top 26%	General Knowledge
AA Coding Index	31.20	Top 27%	Programming
AAII	36.00	Top 21%	General

GPQA Score: 81.35 - Graduate-level reasoning benchmark

Model Comparison:

Provider: openrouter

Model Type: completions

Context Size: 262144 tokens

Comparing against 348 models in the database

Category Scores

Benchmark Tests

View Other Benchmarks

HLE

19.2

General Knowledge

GPQA

81.3

STEM (Physics, Chemistry, Biology)

SciCode

36.0

Scientific

MMLU-Pro

83.7

General Knowledge

AA Coding Index

31.2

Programming

AAII

36.0

General

Metric	HLE	GPQA	SciCode	MMLU-Pro	AA Coding Index	AAII
Score	19.2	81.3	36.0	83.7	31.2	36.0

Compare with Similar Models

claude-opus-4.5

claude-opus-4.6

gemini-3-flash-preview

claude-sonnet-4.5

gemini-3-pro-preview

claude-sonnet-4

Code Examples

Integration samples and API usage

Code Samples for nemotron-3-super-120b-a12b

Python SDK Example:

from openai import OpenAI

client = OpenAI(
    base_url="https://api.langdb.ai/projects/<your_project_id>",
    api_key="<your_api_key>"
)

response = client.chat.completions.create(
    model="nemotron-3-super-120b-a12b",
    messages=[
        {"role": "user", "content": "Hello, how are you?"}
    ]
)

print(response.choices[0].message.content)

TypeScript SDK Example:

import OpenAI from 'openai';

const client = new OpenAI({
    baseURL: "https://api.langdb.ai/projects/<your_project_id>",
    apiKey: "<your_api_key>"
});

const response = await client.chat.completions.create({
    model: "nemotron-3-super-120b-a12b",
    messages: [
        { role: "user", content: "Hello, how are you?" }
    ]
});

console.log(response.choices[0].message.content);

cURL Example:

curl -X POST "https://api.langdb.ai/projects/<your_project_id>/v1/chat/completions" \
  -H "Authorization: Bearer <your_api_key>" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "nemotron-3-super-120b-a12b",
    "messages": [
        {"role": "user", "content": "Hello, how are you?"}
    ]
  }'

Model: nemotron-3-super-120b-a12b

Provider: openrouter

API Endpoint: $https://api.langdb.ai

Create API Key

Related Models

Similar models from openrouter

nemotron-3-super-120b-a12b

nemotron-3-super-120b-a12b

Access nemotron-3-super-120b-a12b through LangDB AI Gateway

Category Scores

Benchmark Tests

Compare with Similar Models

Code Examples

Related Models

aion-1.0

aion-1.0-mini

aion-2.0

aion-rp-llama-3.1-8b

coder-large

cogito-v2.1-671b

nemotron-3-super-120b-a12b by openrouter - AI Model Details, Pricing, and Performance Metrics

nemotron-3-super-120b-a12b

nemotron-3-super-120b-a12b

Access nemotron-3-super-120b-a12b through LangDB AI Gateway

Category Scores

Benchmark Tests

Compare with Similar Models

Code Examples

Related Models

aion-1.0

aion-1.0-mini

aion-2.0

aion-rp-llama-3.1-8b

coder-large

cogito-v2.1-671b