qwen3-235b-a22b-thinking-2507

deepinfra • Performance Analytics

Core Performance Metrics

Total Requests

100.0%

Error Rate

0.00%

0.0%

Total Input Tokens

323,800

Total Output Tokens

35,374

Access qwen3-235b-a22b-thinking-2507 through LangDB AI Gateway

Recommended

Integrate with qwen's qwen3-235b-a22b-thinking-2507 and 250+ other models through a unified API. Monitor usage, control costs, and enhance security.

Unified API

Cost Optimization

Enterprise Security

Get Started Now

Free tier available • No credit card required

Instant Setup

99.9% Uptime

10,000+Monthly Requests

Performance Percentiles

Response Time

53.07s

100.0%

TTFT

0.98s

TPS (Tokens/Second)

250.7 TPS

100.0%

TPOT (Time/Output Token)

0.040ms

Performance Analytics for qwen3-235b-a22b-thinking-2507

Usage Statistics (Last 4 Days):

Total Requests: 27 API calls
Average TPS: 250.66 tokens per second
Average Response Time: 53071.60ms
Average Time to First Token: 984.40ms
Total Cost: $0.08
Average Request Cost: $0.0028

Daily Performance Breakdown:

Date	Requests	TPS	Response Time	TTFT	Cost
11/12/2025	17	326.08	55114.30ms	758.50ms	$0.06
11/13/2025	3	447.87	5885.70ms	491.10ms	$0.00
11/14/2025	5	108.29	40124.00ms	1782.20ms	$0.01
11/17/2025	2	86.52	138856.10ms	1650.20ms	$0.01

Performance Summary:

Model: qwen3-235b-a22b-thinking-2507 by deepinfra

Monitoring Period: 11/11/2025 to 11/18/2025

Average Daily Requests: 7

Peak Daily Requests: 17

Performance Trends

Nov 11 - Nov 18, 2025

Request Volume

Daily API requests

Performance (TPS)

Tokens per second

250.66 tokens/s

Response Time

Average response latency (ms)

53071.60 ms

TTFT

Time to First Token (ms)

984.40 ms

Token Analytics

Token usage distribution and efficiency metrics

Token Distribution

Input vs Output token usage

Input Tokens:323,800

Output Tokens:35,374

Total Tokens:359,174

Token Usage Timeline

Daily token consumption trends

qwen3-235b-a22b-thinking-2507 Performance Analytics - Real-time Metrics, Token Usage & Cost Analysis

qwen3-235b-a22b-thinking-2507

Core Performance Metrics

Access qwen3-235b-a22b-thinking-2507 through LangDB AI Gateway

Performance Percentiles

Performance Trends

Token Analytics