deepseek-r1-distill-llama-8b

openrouter • Performance Analytics

Core Performance Metrics

Total Requests

200.0%

Error Rate

0.00%

0.0%

Total Input Tokens

29,827

537.7%

Total Output Tokens

9,858

357.9%

Access deepseek-r1-distill-llama-8b through LangDB AI Gateway

Recommended

Integrate with deepseek's deepseek-r1-distill-llama-8b and 250+ other models through a unified API. Monitor usage, control costs, and enhance security.

Unified API

Cost Optimization

Enterprise Security

Get Started Now

Free tier available • No credit card required

Instant Setup

99.9% Uptime

10,000+Monthly Requests

Performance Percentiles

Response Time

21.17s

50.5%

TTFT

3.46s

200.7%

TPS (Tokens/Second)

208.3 TPS

28.7%

TPOT (Time/Output Token)

0.020ms

Performance Analytics for deepseek-r1-distill-llama-8b

Usage Statistics (Last 4 Days):

Total Requests: 9 API calls
Average TPS: 208.31 tokens per second
Average Response Time: 21167.50ms
Average Time to First Token: 3457.90ms
Total Cost: $0.00
Average Request Cost: $0.0002

Daily Performance Breakdown:

Date	Requests	TPS	Response Time	TTFT	Cost
9/4/2025	1	451.17	20588.60ms	1676.90ms	$0.00
9/5/2025	1	199.91	25641.50ms	1119.00ms	$0.00
9/6/2025	2	193.45	24763.90ms	1297.70ms	$0.00
9/10/2025	5	165.58	18949.90ms	5146.00ms	$0.00

Performance Summary:

Model: deepseek-r1-distill-llama-8b by openrouter

Monitoring Period: 9/4/2025 to 9/11/2025

Average Daily Requests: 2

Peak Daily Requests: 5

Performance Trends

Sep 4 - Sep 11, 2025

Request Volume

Daily API requests

Performance (TPS)

Tokens per second

208.31 tokens/s

Response Time

Average response latency (ms)

21167.50 ms

TTFT

Time to First Token (ms)

3457.90 ms

Token Analytics

Token usage distribution and efficiency metrics

Token Distribution

Input vs Output token usage

Input Tokens:29,827

Output Tokens:9,858

Total Tokens:39,685

Token Usage Timeline