qwen3-32b

completions

bydeepinfra

qwen3-32b

completions

Published by: qwenProvider:

deepinfra

Qwen3-32B is a dense 32.8B parameter causal language model from the Qwen3 series, optimized for both complex reasoning and efficient dialogue. It supports seamless switching between a "thinking" mode for tasks like math, coding, and logical inference, and a "non-thinking" mode for faster, general-purpose conversation. The model demonstrates strong performance in instruction-following, agent tool use, creative writing, and multilingual tasks across 100+ languages and dialects. It natively handles 32K token contexts and can extend to 131K tokens using YaRN-based scaling.

Released

Apr 29, 2025

Knowledge

Oct 31, 2024

License

Apache-2.0

Context

40960

Input

$0.1 / 1M tokens

Output

$0.3 / 1M tokens

Capabilities: tools

Accepts: text

Returns: text

Released Apr 29, 2025Knowledge Cutoff: Oct 31, 2024License: Apache-2.0

Context: 40960 Input: $0.1 / 1M tokensOutput: $0.3 / 1M tokensCapabilities: toolsAccepts: textReturns: text

Access qwen3-32b through LangDB AI Gateway

Recommended

Integrate with qwen's qwen3-32b and 250+ other models through a unified API. Monitor usage, control costs, and enhance security.

Unified API

Cost Optimization

Enterprise Security

Get Started Now

Free tier available • No credit card required

Instant Setup

99.9% Uptime

10,000+Monthly Requests

Benchmark Results for qwen3-32b

Category Performance Scores:

Maths: Score 73.00 (Top 37% - Rank #129)
Finance: Score 44.75 (Top 40% - Rank #140)
Science: Score 65.36 (Top 52% - Rank #181)
Writing: Score 39.88 (Top 75% - Rank #261)
Academia: Score 41.65 (Top 57% - Rank #199)
Marketing: Score 35.83 (Top 80% - Rank #279)
Programming: Score 13.80 (Top 73% - Rank #255)

Overall Performance: 44.89436507936507 average score across all categories

Detailed Benchmark Scores:

Benchmark	Score	Percentile	Domain
HLE	8.30	Top 40%	General Knowledge
AIME	80.70	Top 15%	Mathematics
GPQA	66.80	Top 55%	STEM (Physics, Chemistry, Biology)
SciCode	35.40	Top 44%	Scientific
MATH-500	96.10	Top 21%	Mathematics
MMLU-Pro	79.80	Top 44%	General Knowledge
LiveCodeBench	54.60	Top 41%	Programming
AA Math Index	73.00	Top 37%	Mathematics
AA Coding Index	13.80	Top 73%	Programming
AAII	16.50	Top 61%	General

GPQA Score: 66.80 - Graduate-level reasoning benchmark

Model Comparison:

Provider: deepinfra

Model Type: completions

Context Size: 40960 tokens

Comparing against 348 models in the database

Category Scores

Benchmark Tests

View Other Benchmarks

HLE

8.3

General Knowledge

AIME

80.7

Mathematics

GPQA

66.8

STEM (Physics, Chemistry, Biology)

SciCode

35.4

Scientific

MATH-500

96.1

Mathematics

MMLU-Pro

79.8

General Knowledge

LiveCodeBench

54.6

Programming

AA Math Index

73.0

Mathematics

AA Coding Index

13.8

Programming

AAII

16.5

General

Metric	HLE	AIME	GPQA	SciCode	MATH-500	MMLU-Pro	LiveCodeBench	AA Math Index	AA Coding Index	AAII
Score	8.3	80.7	66.8	35.4	96.1	79.8	54.6	73.0	13.8	16.5

Compare with Similar Models

claude-opus-4.5

claude-opus-4.6

gemini-3-flash-preview

claude-sonnet-4.5

gemini-3-pro-preview

claude-sonnet-4

Code Examples

Integration samples and API usage

Code Samples for qwen3-32b

Python SDK Example:

from openai import OpenAI

client = OpenAI(
    base_url="https://api.langdb.ai/projects/<your_project_id>",
    api_key="<your_api_key>"
)

response = client.chat.completions.create(
    model="qwen3-32b",
    messages=[
        {"role": "user", "content": "Hello, how are you?"}
    ]
)

print(response.choices[0].message.content)

TypeScript SDK Example:

import OpenAI from 'openai';

const client = new OpenAI({
    baseURL: "https://api.langdb.ai/projects/<your_project_id>",
    apiKey: "<your_api_key>"
});

const response = await client.chat.completions.create({
    model: "qwen3-32b",
    messages: [
        { role: "user", content: "Hello, how are you?" }
    ]
});

console.log(response.choices[0].message.content);

cURL Example:

curl -X POST "https://api.langdb.ai/projects/<your_project_id>/v1/chat/completions" \
  -H "Authorization: Bearer <your_api_key>" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "qwen3-32b",
    "messages": [
        {"role": "user", "content": "Hello, how are you?"}
    ]
  }'

Model: qwen3-32b

Provider: deepinfra

API Endpoint: $https://api.langdb.ai

Create API Key

Related Models

Similar models from deepinfra

qwen3-32b

qwen3-32b

Access qwen3-32b through LangDB AI Gateway

Category Scores

Benchmark Tests

Compare with Similar Models

Code Examples

Related Models

deepseek-chat-v3-0324

deepseek-chat-v3.1

deepseek-prover-v2

DeepSeek-R1

deepseek-r1-0528

DeepSeek-R1-Distill-Llama-70B

qwen3-32b by deepinfra - AI Model Details, Pricing, and Performance Metrics

qwen3-32b

qwen3-32b

Access qwen3-32b through LangDB AI Gateway

Category Scores

Benchmark Tests

Compare with Similar Models

Code Examples

Related Models

deepseek-chat-v3-0324

deepseek-chat-v3.1

deepseek-prover-v2

DeepSeek-R1

deepseek-r1-0528

DeepSeek-R1-Distill-Llama-70B