kimi-k2-thinking by openrouter - AI Model Details, Pricing, and Performance Metrics

moonshotai
kimi-k2-thinking
Try
moonshotai

kimi-k2-thinking

completions
byopenrouter

Kimi K2 Thinking is Moonshot AI’s most advanced open reasoning model to date, extending the K2 series into agentic, long-horizon reasoning. Built on the trillion-parameter Mixture-of-Experts (MoE) architecture introduced in Kimi K2, it activates 32 billion parameters per forward pass and supports 256 k-token context windows. The model is optimized for persistent step-by-step thought, dynamic tool invocation, and complex reasoning workflows that span hundreds of turns. It interleaves step-by-step reasoning with tool use, enabling autonomous research, coding, and writing that can persist for hundreds of sequential actions without drift. It sets new open-source benchmarks on HLE, BrowseComp, SWE-Multilingual, and LiveCodeBench, while maintaining stable multi-agent behavior through 200–300 tool calls. Built on a large-scale MoE architecture with MuonClip optimization, it combines strong reasoning depth with high inference efficiency for demanding agentic and analytical tasks.

Released
Nov 6, 2025
Knowledge
May 10, 2025
Context
262144
Input
$0.57 / 1M tokens
Output
$2.42 / 1M tokens
Cached
$0.15 / 1M tokens
Capabilities: tools, reasoning
Accepts: text
Returns: text

Access kimi-k2-thinking through LangDB AI Gateway

Recommended

Integrate with moonshotai's kimi-k2-thinking and 250+ other models through a unified API. Monitor usage, control costs, and enhance security.

Unified API
Cost Optimization
Enterprise Security
Get Started Now

Free tier available • No credit card required

Instant Setup
99.9% Uptime
10,000+Monthly Requests

Benchmark Tests

View Other Benchmarks
AA Coding Index
52.2
Programming
AAII
67.0
General
AA Math Index
94.7
Mathematics
GPQA
83.8
STEM (Physics, Chemistry, Biology)
HLE
22.3
General Knowledge
LiveCodeBench
85.3
Programming
MMLU-Pro
84.8
General Knowledge
SciCode
42.4
Scientific

Code Examples

Integration samples and API usage