glm-4.5

zai • Performance Analytics

Core Performance Metrics

Total Requests

350

127.3%

Error Rate

8.57%

100.0%

Total Input Tokens

2,051,449

39.2%

Total Output Tokens

173,867

57.9%

Access glm-4.5 through LangDB AI Gateway

Recommended

Integrate with z-ai's glm-4.5 and 250+ other models through a unified API. Monitor usage, control costs, and enhance security.

Unified API

Cost Optimization

Enterprise Security

Get Started Now

Free tier available • No credit card required

Instant Setup

99.9% Uptime

10,000+Monthly Requests

Performance Percentiles

Response Time

14.57s

0.2%

TTFT

3.90s

63.8%

TPS (Tokens/Second)

436.2 TPS

38.3%

TPOT (Time/Output Token)

0.030ms

Performance Analytics for glm-4.5

Usage Statistics (Last 8 Days):

Total Requests: 350 API calls
Average TPS: 436.24 tokens per second
Average Response Time: 14574.60ms
Average Time to First Token: 3898.80ms
Total Cost: $1.91
Average Request Cost: $0.0055

Daily Performance Breakdown:

Date	Requests	TPS	Response Time	TTFT	Cost
9/4/2025	28	463.10	12658.20ms	2426.70ms	$0.14
9/5/2025	21	594.89	12428.90ms	2072.80ms	$0.14
9/6/2025	97	293.37	16628.60ms	3941.50ms	$0.42
9/7/2025	31	519.39	13105.70ms	2144.60ms	$0.19
9/8/2025	15	575.24	12427.80ms	2045.20ms	$0.09
9/9/2025	55	270.86	22536.30ms	9624.80ms	$0.31
9/10/2025	78	705.14	10485.20ms	2437.30ms	$0.47
9/11/2025	25	907.29	8906.80ms	2166.00ms	$0.15

Performance Summary:

Model: glm-4.5 by zai

Monitoring Period: 9/4/2025 to 9/11/2025

Average Daily Requests: 44

Peak Daily Requests: 97

Performance Trends

Sep 4 - Sep 11, 2025

Request Volume

Daily API requests

350

Performance (TPS)

Tokens per second

436.24 tokens/s

Response Time

Average response latency (ms)

14574.60 ms

TTFT

Time to First Token (ms)

3898.80 ms

Token Analytics

Token usage distribution and efficiency metrics

Token Distribution

Input vs Output token usage

Input Tokens:2,051,449

Output Tokens:173,867

Total Tokens:2,225,316

Token Usage Timeline