gpt-oss-120b

deepinfra • Performance Analytics

Core Performance Metrics

Total Requests

288.9%

Error Rate

0.00%

0.0%

Total Input Tokens

14,993

2928.9%

Total Output Tokens

14,749

3812.2%

Access gpt-oss-120b through LangDB AI Gateway

Recommended

Integrate with openai's gpt-oss-120b and 250+ other models through a unified API. Monitor usage, control costs, and enhance security.

Unified API

Cost Optimization

Enterprise Security

Get Started Now

Free tier available • No credit card required

Instant Setup

99.9% Uptime

10,000+Monthly Requests

Performance Percentiles

Response Time

6.94s

941.9%

TTFT

6.94s

935.8%

TPS (Tokens/Second)

122.5 TPS

15.8%

TPOT (Time/Output Token)

0.020ms

Performance Analytics for gpt-oss-120b

Usage Statistics (Last 3 Days):

Total Requests: 35 API calls
Average TPS: 122.45 tokens per second
Average Response Time: 6939.90ms
Average Time to First Token: 6939.90ms
Total Cost: $0.01
Average Request Cost: $0.0003

Daily Performance Breakdown:

Date	Requests	TPS	Response Time	TTFT	Cost
11/11/2025	1	1774.67	2923.40ms	2923.40ms	$0.00
11/12/2025	4	165.87	4903.00ms	4903.00ms	$0.00
11/17/2025	30	96.66	7345.30ms	7345.30ms	$0.01

Performance Summary:

Model: gpt-oss-120b by deepinfra

Monitoring Period: 11/11/2025 to 11/18/2025

Average Daily Requests: 12

Peak Daily Requests: 30

Performance Trends

Nov 11 - Nov 18, 2025

Request Volume

Daily API requests

Performance (TPS)

Tokens per second

122.45 tokens/s

Response Time

Average response latency (ms)

6939.90 ms

TTFT

Time to First Token (ms)

6939.90 ms

Token Analytics

Token usage distribution and efficiency metrics

Token Distribution

Input vs Output token usage

Input Tokens:14,993

Output Tokens:14,749

Total Tokens:29,742

Token Usage Timeline