qwen3-235b-a22b-thinking-2507

completions

bydeepinfra

qwen3-235b-a22b-thinking-2507

completions

Published by: qwenProvider:

deepinfra

Qwen3-235B-A22B-Thinking-2507 is a high-performance, open-weight Mixture-of-Experts (MoE) language model optimized for complex reasoning tasks. It activates 22B of its 235B parameters per forward pass and natively supports up to 262,144 tokens of context. This "thinking-only" variant enhances structured logical reasoning, mathematics, science, and long-form generation, showing strong benchmark performance across AIME, SuperGPQA, LiveCodeBench, and MMLU-Redux. It enforces a special reasoning mode (</think>) and is designed for high-token outputs (up to 81,920 tokens) in challenging domains. The model is instruction-tuned and excels at step-by-step reasoning, tool use, agentic workflows, and multilingual tasks. This release represents the most capable open-source variant in the Qwen3-235B series, surpassing many closed models in structured reasoning use cases.

Released

Apr 29, 2025

Knowledge

Oct 31, 2024

License

Apache-2.0

Context

262144

Input

$0.13 / 1M tokens

Output

$0.6 / 1M tokens

Accepts: text

Returns: text

Released Apr 29, 2025Knowledge Cutoff: Oct 31, 2024License: Apache-2.0

Context: 262144 Input: $0.13 / 1M tokensOutput: $0.6 / 1M tokensAccepts: textReturns: text

Access qwen3-235b-a22b-thinking-2507 through LangDB AI Gateway

Recommended

Integrate with qwen's qwen3-235b-a22b-thinking-2507 and 250+ other models through a unified API. Monitor usage, control costs, and enhance security.

Unified API

Cost Optimization

Enterprise Security

Get Started Now

Free tier available • No credit card required

Instant Setup

99.9% Uptime

10,000+Monthly Requests

Benchmark Results for qwen3-235b-a22b-thinking-2507

Category Performance Scores:

Science: Score 82.54 (Top 2% - Rank #7)
Writing: Score 83.16 (Top 1% - Rank #4)

Overall Performance: 82.853125 average score across all categories

Detailed Benchmark Scores:

Benchmark	Score	Percentile	Domain
GPQA	81.10	Top 14%	STEM (Physics, Chemistry, Biology)
MMLU-Pro	84.40	Top 13%	General Knowledge

GPQA Score: 81.10 - Graduate-level reasoning benchmark

Model Comparison:

Provider: deepinfra

Model Type: completions

Context Size: 262144 tokens

Comparing against 322 models in the database

Performance Analytics for qwen3-235b-a22b-thinking-2507

Usage Statistics (Last 4 Days):

Total Requests: 27 API calls
Average TPS: 250.66 tokens per second
Average Response Time: 53071.60ms
Average Time to First Token: 984.40ms
Total Cost: $0.08
Average Request Cost: $0.0028

Daily Performance Breakdown:

Date	Requests	TPS	Response Time	TTFT	Cost
11/12/2025	17	326.08	55114.30ms	758.50ms	$0.06
11/13/2025	3	447.87	5885.70ms	491.10ms	$0.00
11/14/2025	5	108.29	40124.00ms	1782.20ms	$0.01
11/17/2025	2	86.52	138856.10ms	1650.20ms	$0.01

Performance Summary:

Model: qwen3-235b-a22b-thinking-2507 by deepinfra

Monitoring Period: 11/11/2025 to 11/18/2025

Average Daily Requests: 7

Peak Daily Requests: 17

Statistics

View Full Statistics

Request Volume

Daily API requests

Performance (TPS)

Tokens per second

250.66 tokens/s

Category Scores

science#5 writing#3

Benchmark Tests

View Other Benchmarks

GPQA

81.1

STEM (Physics, Chemistry, Biology)

MMLU-Pro

84.4

General Knowledge

Metric	GPQA	MMLU-Pro
Score	81.1	84.4

Compare with Similar Models

claude-opus-4.5

gemini-3-flash-preview

Code Examples

Integration samples and API usage

Code Samples for qwen3-235b-a22b-thinking-2507

Python SDK Example:

from openai import OpenAI

client = OpenAI(
    base_url="https://api.us-east-1.langdb.ai//projects/<your_project_id>",
    api_key="<your_api_key>"
)

response = client.chat.completions.create(
    model="qwen3-235b-a22b-thinking-2507",
    messages=[
        {"role": "user", "content": "Hello, how are you?"}
    ]
)

print(response.choices[0].message.content)

TypeScript SDK Example:

import OpenAI from 'openai';

const client = new OpenAI({
    baseURL: "https://api.us-east-1.langdb.ai//projects/<your_project_id>",
    apiKey: "<your_api_key>"
});

const response = await client.chat.completions.create({
    model: "qwen3-235b-a22b-thinking-2507",
    messages: [
        { role: "user", content: "Hello, how are you?" }
    ]
});

console.log(response.choices[0].message.content);

cURL Example:

curl -X POST "https://api.us-east-1.langdb.ai//projects/<your_project_id>/v1/chat/completions" \
  -H "Authorization: Bearer <your_api_key>" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "qwen3-235b-a22b-thinking-2507",
    "messages": [
        {"role": "user", "content": "Hello, how are you?"}
    ]
  }'

Model: qwen3-235b-a22b-thinking-2507

Provider: deepinfra

API Endpoint: $https://api.us-east-1.langdb.ai/

Create API Key

Related Models

Similar models from deepinfra

qwen3-235b-a22b-thinking-2507

qwen3-235b-a22b-thinking-2507

Access qwen3-235b-a22b-thinking-2507 through LangDB AI Gateway

Statistics

Category Scores

Benchmark Tests

Compare with Similar Models

Code Examples

Related Models

deepseek-chat-v3-0324

deepseek-chat-v3.1

deepseek-prover-v2

DeepSeek-R1

deepseek-r1-0528

DeepSeek-R1-Distill-Llama-70B

qwen3-235b-a22b-thinking-2507 by deepinfra - AI Model Details, Pricing, and Performance Metrics

qwen3-235b-a22b-thinking-2507

qwen3-235b-a22b-thinking-2507

Access qwen3-235b-a22b-thinking-2507 through LangDB AI Gateway

Statistics

Category Scores

Benchmark Tests

Compare with Similar Models

Code Examples

Related Models

deepseek-chat-v3-0324

deepseek-chat-v3.1

deepseek-prover-v2

DeepSeek-R1

deepseek-r1-0528

DeepSeek-R1-Distill-Llama-70B