llama-3.3-70b-instruct

deepinfra • Performance Analytics

Core Performance Metrics

Total Requests

110

100.0%

Error Rate

0.00%

0.0%

Total Input Tokens

42,632

Total Output Tokens

3,740

Access llama-3.3-70b-instruct through LangDB AI Gateway

Recommended

Integrate with meta's llama-3.3-70b-instruct and 250+ other models through a unified API. Monitor usage, control costs, and enhance security.

Unified API

Cost Optimization

Enterprise Security

Get Started Now

Free tier available • No credit card required

Instant Setup

99.9% Uptime

10,000+Monthly Requests

Performance Percentiles

Response Time

1.90s

100.0%

TTFT

1.90s

TPS (Tokens/Second)

221.6 TPS

100.0%

TPOT (Time/Output Token)

0.060ms

Performance Analytics for llama-3.3-70b-instruct

Usage Statistics (Last 1 Days):

Total Requests: 110 API calls
Average TPS: 221.62 tokens per second
Average Response Time: 1902.20ms
Average Time to First Token: 1902.20ms
Total Cost: $0.00
Average Request Cost: $0.0000

Daily Performance Breakdown:

Date	Requests	TPS	Response Time	TTFT	Cost
11/12/2025	110	221.62	1902.20ms	1902.20ms	$0.00

Performance Summary:

Model: llama-3.3-70b-instruct by deepinfra

Monitoring Period: 11/11/2025 to 11/18/2025

Average Daily Requests: 110

Peak Daily Requests: 110

Performance Trends

Nov 11 - Nov 18, 2025

Request Volume

Daily API requests

110

Performance (TPS)

Tokens per second

221.62 tokens/s

Response Time

Average response latency (ms)

1902.20 ms

TTFT

Time to First Token (ms)

1902.20 ms

Token Analytics

Token usage distribution and efficiency metrics

Token Distribution

Input vs Output token usage

Input Tokens:42,632

Output Tokens:3,740

Total Tokens:46,372

Token Usage Timeline

Daily token consumption trends

llama-3.3-70b-instruct Performance Analytics - Real-time Metrics, Token Usage & Cost Analysis

llama-3.3-70b-instruct

Core Performance Metrics

Access llama-3.3-70b-instruct through LangDB AI Gateway

Performance Percentiles

Performance Trends

Token Analytics