qwen3-32b by deepinfra - AI Model Details, Pricing, and Performance Metrics

qwen
qwen3-32b
qwen

qwen3-32b

completions
On:deepinfraparasail

Qwen3-32B is a dense 32.8B parameter causal language model from the Qwen3 series, optimized for both complex reasoning and efficient dialogue. It supports seamless switching between a "thinking" mode for tasks like math, coding, and logical inference, and a "non-thinking" mode for faster, general-purpose conversation. The model demonstrates strong performance in instruction-following, agent tool use, creative writing, and multilingual tasks across 100+ languages and dialects. It natively handles 32K token contexts and can extend to 131K tokens using YaRN-based scaling.

ProviderInputOutput
deepinfra
deepinfra
$0.1 / 1M tokens$0.3 / 1M tokens
parasail
parasail
$0.1 / 1M tokens$0.5 / 1M tokens
Released
Apr 29, 2025
Knowledge
Oct 31, 2024
Context
40960
Input
$0.1 / 1M tokens
Output
$0.3 / 1M tokens
Capabilities: tools
Accepts: text
Returns: text

Access qwen3-32b through LangDB AI Gateway

Recommended

Integrate with qwen's qwen3-32b and 250+ other models through a unified API. Monitor usage, control costs, and enhance security.

Unified API
Cost Optimization
Enterprise Security
Get Started Now

Free tier available • No credit card required

Instant Setup
99.9% Uptime
10,000+Monthly Requests
Available from 2 providers
Provider:

Category Scores

Benchmark Tests

View Other Benchmarks
AIME
30.3
Mathematics
AA Coding Index
28.4
Programming
AAII
26.4
General
AA Math Index
19.7
Mathematics
GPQA
53.5
STEM (Physics, Chemistry, Biology)
HLE
4.3
General Knowledge
LiveCodeBench
28.8
Programming
MATH-500
86.9
Mathematics
MMLU-Pro
72.7
General Knowledge
SciCode
28.0
Scientific

Code Examples

Integration samples and API usage