qwen3-coder

completions

Qwen3-Coder-480B-A35B-Instruct is a Mixture-of-Experts (MoE) code generation model developed by the Qwen team. It is optimized for agentic coding tasks such as function calling, tool use, and long-context reasoning over repositories. The model features 480 billion total parameters, with 35 billion active per forward pass (8 out of 160 experts). Pricing for the Alibaba endpoints varies by context length. Once a request is greater than 128k input tokens, the higher pricing is used.

Input:$0.2 / 1M tokens

Output:$0.8 / 1M tokens

Context:262144 tokens

tools

text

Category Rankings

programming#14

Access qwen3-coder through LangDB AI Gateway

Recommended

Integrate with qwen's qwen3-coder and 250+ other models through a unified API. Monitor usage, control costs, and enhance security.

Unified API

Cost Optimization

Enterprise Security

Get Started Now

Free tier available • No credit card required

Instant Setup

99.9% Uptime

10,000+Monthly Requests

Code Example

Configuration

Base URL

API Keys

Headers

Project ID in header

X-Run-Id

X-Thread-Id

Model Parameters

17 available

frequency_penalty

-202

logit_bias

logprobs

max_tokens

min_p

001

presence_penalty

-201.999

repetition_penalty

012

response_format

seed

stop

structured_outputs

temperature

012

tool_choice

tools

top_k

top_logprobs

top_p

011

Additional Configuration

Tools

Guards

User:

Id:

Name:

Tags:

Publicly Shared Threads0

Discover shared experiences

Shared threads will appear here, showcasing real-world applications and insights from the community. Check back soon for updates!

Share your threads to help others

Popular Models10

deepseek-chat-v3-0324
deepseek
DeepSeek V3, a 685B-parameter, mixture-of-experts model, is the latest iteration of the flagship chat model family from the DeepSeek team. It succeeds the [DeepSeek V3](/deepseek/deepseek-chat-v3) model and performs really well on a variety of tasks.
Input:$0.18 / 1M tokens
Output:$0.72 / 1M tokens
Context:163840 tokens
tools
text
text
deepseek-chat
deepseek
DeepSeek-Chat is an advanced conversational AI model designed to provide intelligent
Input:$0.27 / 1M tokens
Output:$1.1 / 1M tokens
Context:64K tokens
tools
text
text
deepseek-r1-0528
deepseek
May 28th update to the [original DeepSeek R1](/deepseek/deepseek-r1) Performance on par with [OpenAI o1](/openai/o1), but open-sourced and with fully open reasoning tokens. It's 671B parameters in size, with 37B active in an inference pass. Fully open-source model.
Input:$0.18 / 1M tokens
Output:$0.72 / 1M tokens
Context:163840 tokens
text
text
gpt-4o-mini
openai
GPT-4o mini (o for omni) is a fast, affordable small model for focused tasks. It accepts both text and image inputs, and produces text outputs (including Structured Outputs). It is ideal for fine-tuning, and model outputs from a larger model like GPT-4o can be distilled to GPT-4o-mini to produce similar results at lower cost and latency.The knowledge cutoff for GPT-4o-mini models is October, 2023.
Input:$0.15 / 1M tokens
Output:$0.6 / 1M tokens
Context:128K tokens
tools
text
image
text
qwen3-235b-a22b-2507
qwen
Qwen3-235B-A22B-Instruct-2507 is a multilingual, instruction-tuned mixture-of-experts language model based on the Qwen3-235B architecture, with 22B active parameters per forward pass. It is optimized for general-purpose text generation, including instruction following, logical reasoning, math, code, and tool usage. The model supports a native 262K context length and does not implement "thinking mode" (<think> blocks). Compared to its base variant, this version delivers significant gains in knowledge coverage, long-context reasoning, coding benchmarks, and alignment with open-ended tasks. It is particularly strong on multilingual understanding, math reasoning (e.g., AIME, HMMT), and alignment evaluations like Arena-Hard and WritingBench.
Input:$0.08 / 1M tokens
Output:$0.31 / 1M tokens
Context:262144 tokens
text
text
claude-sonnet-4
anthropic
Our high-performance model with exceptional reasoning and efficiency
Input:$3 / 1M tokens
Output:$15 / 1M tokens
Context:200K tokens
tools
text
image
text
claude-opus-4
anthropic
Our most capable and intelligent model yet. Claude Opus 4 sets new standards in complex reasoning and advanced coding
Input:$15 / 1M tokens
Output:$75 / 1M tokens
Context:200K tokens
tools
text
image
text
gemini-2.5-pro
gemini
Gemini 2.5 Pro is our most advanced reasoning Gemini model, capable of solving complex problems.
Input:$1.25 / 1M tokens
Output:$10 / 1M tokens
Context:1M tokens
tools
text
image
audio
video
text
gpt-4.1
openai
GPT-4.1 is OpenAI's flagship model for complex tasks. It is well suited for problem solving across domains.
Input:$2 / 1M tokens
Output:$8 / 1M tokens
Context:1047576 tokens
tools
text
image
text
gemini-2.5-pro-preview
gemini
Gemini 2.5 Pro Experimental is Google's state-of-the-art thinking model, capable of reasoning over complex problems in code, math, and STEM, as well as analyzing large datasets, codebases, and documents using long context.
Input:$1.25 / 1M tokens
Output:$10 / 1M tokens
Context:1M tokens
tools
text
image
audio
video
text