qwen3-32b by deepinfra - AI Model Details, Pricing, and Performance Metrics

qwen
qwen3-32b
qwen

qwen3-32b

completions
On:deepinfragroqparasail

Qwen3-32B is a dense 32.8B parameter causal language model from the Qwen3 series, optimized for both complex reasoning and efficient dialogue. It supports seamless switching between a "thinking" mode for tasks like math, coding, and logical inference, and a "non-thinking" mode for faster, general-purpose conversation. The model demonstrates strong performance in instruction-following, agent tool use, creative writing, and multilingual tasks across 100+ languages and dialects. It natively handles 32K token contexts and can extend to 131K tokens using YaRN-based scaling.

ProviderInputOutput
deepinfra
deepinfra
$0.1 / 1M tokens$0.3 / 1M tokens
groq
groq
$0.29 / 1M tokens$0.59 / 1M tokens
parasail
parasail
$0.1 / 1M tokens$0.5 / 1M tokens
Released
Apr 29, 2025
Knowledge
Oct 31, 2024
Context
40960
Input
$0.1 / 1M tokens
Output
$0.3 / 1M tokens
Capabilities: tools
Accepts: text
Returns: text

Access qwen3-32b through LangDB AI Gateway

Recommended

Integrate with qwen's qwen3-32b and 250+ other models through a unified API. Monitor usage, control costs, and enhance security.

Unified API
Cost Optimization
Enterprise Security
Get Started Now

Free tier available • No credit card required

Instant Setup
99.9% Uptime
10,000+Monthly Requests
Available from 3 providers
Provider:

Category Scores

Benchmark Tests

View Other Benchmarks
AIME
80.7
Mathematics
AA Coding Index
30.9
Programming
AAII
38.7
General
AA Math Index
73.0
Mathematics
GPQA
66.8
STEM (Physics, Chemistry, Biology)
HLE
8.3
General Knowledge
LiveCodeBench
54.6
Programming
MATH-500
96.1
Mathematics
MMLU-Pro
79.8
General Knowledge
SciCode
35.4
Scientific

Code Examples

Integration samples and API usage