DeepSeek-R1-Distill-Llama-70B

togetherai • Performance Analytics

Core Performance Metrics

Total Requests

1600.0%

Error Rate

0.00%

0.0%

Total Input Tokens

70,324

798.4%

Total Output Tokens

6,414

1011.6%

Access DeepSeek-R1-Distill-Llama-70B through LangDB AI Gateway

Recommended

Integrate with deepseek's DeepSeek-R1-Distill-Llama-70B and 250+ other models through a unified API. Monitor usage, control costs, and enhance security.

Unified API

Cost Optimization

Enterprise Security

Get Started Now

Free tier available • No credit card required

Instant Setup

99.9% Uptime

10,000+Monthly Requests

Performance Percentiles

Response Time

12.48s

53.7%

TTFT

12.48s

53.7%

TPS (Tokens/Second)

361.6 TPS

65.1%

TPOT (Time/Output Token)

0.030ms

Performance Analytics for DeepSeek-R1-Distill-Llama-70B

Usage Statistics (Last 1 Days):

Total Requests: 17 API calls
Average TPS: 361.61 tokens per second
Average Response Time: 12483.20ms
Average Time to First Token: 12483.20ms
Total Cost: $0.18
Average Request Cost: $0.0108

Daily Performance Breakdown:

Date	Requests	TPS	Response Time	TTFT	Cost
11/11/2025	17	361.61	12483.20ms	12483.20ms	$0.18

Performance Summary:

Model: DeepSeek-R1-Distill-Llama-70B by togetherai

Monitoring Period: 11/11/2025 to 11/18/2025

Average Daily Requests: 17

Peak Daily Requests: 17

Performance Trends

Nov 11 - Nov 18, 2025

Request Volume

Daily API requests

Performance (TPS)

Tokens per second

361.61 tokens/s

Response Time

Average response latency (ms)

12483.20 ms

TTFT

Time to First Token (ms)

12483.20 ms

Token Analytics

Token usage distribution and efficiency metrics

Token Distribution

Input vs Output token usage

Input Tokens:70,324

Output Tokens:6,414

Total Tokens:76,738

Token Usage Timeline