grok-4-fast by openrouter - AI Model Details, Pricing, and Performance Metrics

x-ai

grok-4-fast

completions
byopenrouter

Grok 4 Fast is xAI's latest multimodal model with SOTA cost-efficiency and a 2M token context window. It comes in two flavors: non-reasoning and reasoning. Read more about the model on xAI's [news post](http://x.ai/news/grok-4-fast). Reasoning can be enabled using the `reasoning` `enabled` parameter in the API. [Learn more in our docs](https://openrouter.ai/docs/use-cases/reasoning-tokens#controlling-reasoning-tokens) Prompts and completions on Grok 4 Fast Free may be used by xAI or OpenRouter to improve future models.

Released
Aug 28, 2025
Knowledge
Mar 1, 2025
License
Proprietary
Context
2M
Input
$0.2 / 1M tokens
Output
$0.5 / 1M tokens
Cached
$0.05 / 1M tokens
Capabilities: tools, reasoning
Accepts: text, image
Returns: text

Access grok-4-fast through LangDB AI Gateway

Recommended

Integrate with x-ai's grok-4-fast and 250+ other models through a unified API. Monitor usage, control costs, and enhance security.

Unified API
Cost Optimization
Enterprise Security
Get Started Now

Free tier available • No credit card required

Instant Setup
99.9% Uptime
10,000+Monthly Requests
Request Volume
Daily API requests
400
Performance (TPS)
Tokens per second
883.19 tokens/s

Category Scores

Benchmark Tests

View Other Benchmarks
HLE
17.0
General Knowledge
GPQA
84.7
STEM (Physics, Chemistry, Biology)
SciCode
44.2
Scientific
MMLU-Pro
85.0
General Knowledge
LiveCodeBench
83.2
Programming
AA Math Index
89.7
Mathematics
AA Coding Index
48.4
Programming
AAII
60.3
General

Code Examples

Integration samples and API usage