glm-4.6v by openrouter - AI Model Details, Pricing, and Performance Metrics

zai

glm-4.6v

completions
byopenrouter

GLM-4.6V is a large multimodal model designed for high-fidelity visual understanding and long-context reasoning across images, documents, and mixed media. It supports up to 128K tokens, processes complex page layouts and charts directly as visual inputs, and integrates native multimodal function calling to connect perception with downstream tool execution. The model also enables interleaved image-text generation and UI reconstruction workflows, including screenshot-to-HTML synthesis and iterative visual editing.

Released
Dec 8, 2025
Knowledge
Jun 11, 2025
Context
131072
Input
$0.3 / 1M tokens
Output
$0.9 / 1M tokens
Cached
$0.15 / 1M tokens
Capabilities: tools, reasoning
Accepts: text, image
Returns: text

Access glm-4.6v through LangDB AI Gateway

Recommended

Integrate with zai's glm-4.6v and 250+ other models through a unified API. Monitor usage, control costs, and enhance security.

Unified API
Cost Optimization
Enterprise Security
Get Started Now

Free tier available • No credit card required

Instant Setup
99.9% Uptime
10,000+Monthly Requests

Category Scores

Benchmark Tests

View Other Benchmarks
HLE
8.9
General Knowledge
GPQA
71.9
STEM (Physics, Chemistry, Biology)
SciCode
30.4
Scientific
MMLU-Pro
79.9
General Knowledge
LiveCodeBench
16.0
Programming
AA Math Index
85.3
Mathematics
AA Coding Index
19.7
Programming
AAII
23.5
General

Code Examples

Integration samples and API usage