DeepSeek-R1-Distill-Qwen-32B

deepinfra • Performance Analytics

Core Performance Metrics

Total Requests

100.0%

Error Rate

0.00%

0.0%

Total Input Tokens

68,886

Total Output Tokens

2,291

Access DeepSeek-R1-Distill-Qwen-32B through LangDB AI Gateway

Recommended

Integrate with deepseek's DeepSeek-R1-Distill-Qwen-32B and 250+ other models through a unified API. Monitor usage, control costs, and enhance security.

Unified API

Cost Optimization

Enterprise Security

Get Started Now

Free tier available • No credit card required

Instant Setup

99.9% Uptime

10,000+Monthly Requests

Performance Percentiles

Response Time

10.96s

100.0%

TTFT

9.61s

TPS (Tokens/Second)

1298.5 TPS

100.0%

TPOT (Time/Output Token)

0.020ms

Performance Analytics for DeepSeek-R1-Distill-Qwen-32B

Usage Statistics (Last 2 Days):

Total Requests: 5 API calls
Average TPS: 1298.45 tokens per second
Average Response Time: 10963.40ms
Average Time to First Token: 9614.80ms
Total Cost: $0.01
Average Request Cost: $0.0021

Daily Performance Breakdown:

Date	Requests	TPS	Response Time	TTFT	Cost
9/21/2025	2	4372.74	6275.20ms	2903.90ms	$0.01
9/22/2025	3	385.58	14088.80ms	14088.80ms	$0.00

Performance Summary:

Model: DeepSeek-R1-Distill-Qwen-32B by deepinfra

Monitoring Period: 9/16/2025 to 9/23/2025

Average Daily Requests: 3

Peak Daily Requests: 3

Performance Trends

Sep 16 - Sep 23, 2025

Request Volume

Daily API requests

Performance (TPS)

Tokens per second

1298.45 tokens/s

Response Time

Average response latency (ms)

10963.40 ms

TTFT

Time to First Token (ms)

9614.80 ms

Token Analytics

Token usage distribution and efficiency metrics

Token Distribution

Input vs Output token usage

Input Tokens:68,886

Output Tokens:2,291

Total Tokens:71,177

Token Usage Timeline

Daily token consumption trends

DeepSeek-R1-Distill-Qwen-32B Performance Analytics - Real-time Metrics, Token Usage & Cost Analysis

DeepSeek-R1-Distill-Qwen-32B

Core Performance Metrics

Access DeepSeek-R1-Distill-Qwen-32B through LangDB AI Gateway

Performance Percentiles

Performance Trends

Token Analytics