qwen3-coder by deepinfra - AI Model Details, Pricing, and Performance Metrics

qwen
qwen3-coder
qwen

qwen3-coder

completions
On:deepinfraparasail

Qwen3-Coder-480B-A35B-Instruct is a Mixture-of-Experts (MoE) code generation model developed by the Qwen team. It is optimized for agentic coding tasks such as function calling, tool use, and long-context reasoning over repositories. The model features 480 billion total parameters, with 35 billion active per forward pass (8 out of 160 experts). Pricing for the Alibaba endpoints varies by context length. Once a request is greater than 128k input tokens, the higher pricing is used.

ProviderInputOutputCached
deepinfra
deepinfra
$0.3 / 1M tokens$1.2 / 1M tokens$0.15 / 1M tokens
parasail
parasail
$0.39 / 1M tokens$1.6 / 1M tokens-
Released
Jul 22, 2025
Knowledge
Jan 23, 2025
Context
262144
Input
$0.3 / 1M tokens
Output
$1.2 / 1M tokens
Cached
$0.15 / 1M tokens
Capabilities: tools
Accepts: text
Returns: text

Access qwen3-coder through LangDB AI Gateway

Recommended

Integrate with qwen's qwen3-coder and 250+ other models through a unified API. Monitor usage, control costs, and enhance security.

Unified API
Cost Optimization
Enterprise Security
Get Started Now

Free tier available • No credit card required

Instant Setup
99.9% Uptime
10,000+Monthly Requests
Available from 2 providers
Provider:

Category Scores

Benchmark Tests

View Other Benchmarks
AIME
47.7
Mathematics
AA Coding Index
47.2
Programming
AAII
42.3
General
AA Math Index
39.3
Mathematics
GPQA
61.8
STEM (Physics, Chemistry, Biology)
HLE
4.4
General Knowledge
LiveCodeBench
58.5
Programming
MATH-500
94.2
Mathematics
MMLU-Pro
78.8
General Knowledge
SciCode
35.9
Scientific

Code Examples

Integration samples and API usage