llama-3.3-nemotron-super-49b-v1.5

completions

byopenrouter

llama-3.3-nemotron-super-49b-v1.5

completions

Published by: nvidiaProvider:

openrouter

Llama-3.3-Nemotron-Super-49B-v1.5 is a 49B-parameter, English-centric reasoning/chat model derived from Meta’s Llama-3.3-70B-Instruct with a 128K context. It’s post-trained for agentic workflows (RAG, tool calling) via SFT across math, code, science, and multi-turn chat, followed by multiple RL stages; Reward-aware Preference Optimization (RPO) for alignment, RL with Verifiable Rewards (RLVR) for step-wise reasoning, and iterative DPO to refine tool-use behavior. A distillation-driven Neural Architecture Search (“Puzzle”) replaces some attention blocks and varies FFN widths to shrink memory footprint and improve throughput, enabling single-GPU (H100/H200) deployment while preserving instruction following and CoT quality. In internal evaluations (NeMo-Skills, up to 16 runs, temp = 0.6, top_p = 0.95), the model reports strong reasoning/coding results, e.g., MATH500 pass@1 = 97.4, AIME-2024 = 87.5, AIME-2025 = 82.71, GPQA = 71.97, LiveCodeBench (24.10–25.02) = 73.58, and MMLU-Pro (CoT) = 79.53. The model targets practical inference efficiency (high tokens/s, reduced VRAM) with Transformers/vLLM support and explicit “reasoning on/off” modes (chat-first defaults, greedy recommended when disabled). Suitable for building agents, assistants, and long-context retrieval systems where balanced accuracy-to-cost and reliable tool use matter.

Context

131072

Input

$0.1 / 1M tokens

Output

$0.4 / 1M tokens

Capabilities: tools, reasoning

Accepts: text

Returns: text

Context: 131072 Input: $0.1 / 1M tokensOutput: $0.4 / 1M tokensCapabilities: tools, reasoningAccepts: textReturns: text

Access llama-3.3-nemotron-super-49b-v1.5 through LangDB AI Gateway

Recommended

Integrate with nvidia's llama-3.3-nemotron-super-49b-v1.5 and 250+ other models through a unified API. Monitor usage, control costs, and enhance security.

Unified API

Cost Optimization

Enterprise Security

Get Started Now

Free tier available • No credit card required

Instant Setup

99.9% Uptime

10,000+Monthly Requests

Code Examples

Integration samples and API usage

Code Samples for llama-3.3-nemotron-super-49b-v1.5

Python SDK Example:

from openai import OpenAI

client = OpenAI(
    base_url="https://api.us-east-1.langdb.ai//projects/<your_project_id>",
    api_key="<your_api_key>"
)

response = client.chat.completions.create(
    model="llama-3.3-nemotron-super-49b-v1.5",
    messages=[
        {"role": "user", "content": "Hello, how are you?"}
    ]
)

print(response.choices[0].message.content)

TypeScript SDK Example:

import OpenAI from 'openai';

const client = new OpenAI({
    baseURL: "https://api.us-east-1.langdb.ai//projects/<your_project_id>",
    apiKey: "<your_api_key>"
});

const response = await client.chat.completions.create({
    model: "llama-3.3-nemotron-super-49b-v1.5",
    messages: [
        { role: "user", content: "Hello, how are you?" }
    ]
});

console.log(response.choices[0].message.content);

cURL Example:

curl -X POST "https://api.us-east-1.langdb.ai//projects/<your_project_id>/v1/chat/completions" \
  -H "Authorization: Bearer <your_api_key>" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "llama-3.3-nemotron-super-49b-v1.5",
    "messages": [
        {"role": "user", "content": "Hello, how are you?"}
    ]
  }'

Model: llama-3.3-nemotron-super-49b-v1.5

Provider: openrouter

API Endpoint: $https://api.us-east-1.langdb.ai/

Create API Key

Related Models

Similar models from openrouter

llama-3.3-nemotron-super-49b-v1.5

llama-3.3-nemotron-super-49b-v1.5

Access llama-3.3-nemotron-super-49b-v1.5 through LangDB AI Gateway

Code Examples

Related Models

afm-4.5b

aion-1.0

aion-1.0-mini

aion-rp-llama-3.1-8b

coder-large

cogito-v2-preview-deepseek-671b

llama-3.3-nemotron-super-49b-v1.5 by openrouter - AI Model Details, Pricing, and Performance Metrics

llama-3.3-nemotron-super-49b-v1.5

llama-3.3-nemotron-super-49b-v1.5

Access llama-3.3-nemotron-super-49b-v1.5 through LangDB AI Gateway

Code Examples

Related Models

afm-4.5b

aion-1.0

aion-1.0-mini

aion-rp-llama-3.1-8b

coder-large

cogito-v2-preview-deepseek-671b