kimi-k2-thinking

completions

byopenrouter

kimi-k2-thinking

completions

Published by: moonshotaiProvider:

openrouter

Kimi K2 Thinking is Moonshot AI’s most advanced open reasoning model to date, extending the K2 series into agentic, long-horizon reasoning. Built on the trillion-parameter Mixture-of-Experts (MoE) architecture introduced in Kimi K2, it activates 32 billion parameters per forward pass and supports 256 k-token context windows. The model is optimized for persistent step-by-step thought, dynamic tool invocation, and complex reasoning workflows that span hundreds of turns. It interleaves step-by-step reasoning with tool use, enabling autonomous research, coding, and writing that can persist for hundreds of sequential actions without drift. It sets new open-source benchmarks on HLE, BrowseComp, SWE-Multilingual, and LiveCodeBench, while maintaining stable multi-agent behavior through 200–300 tool calls. Built on a large-scale MoE architecture with MuonClip optimization, it combines strong reasoning depth with high inference efficiency for demanding agentic and analytical tasks.

Released

Nov 6, 2025

Knowledge

May 10, 2025

Context

262144

Input

$0.57 / 1M tokens

Output

$2.42 / 1M tokens

Cached

$0.15 / 1M tokens

Capabilities: tools, reasoning

Accepts: text

Returns: text

Released Nov 6, 2025Knowledge Cutoff: May 10, 2025

Context: 262144 Input: $0.57 / 1M tokensOutput: $2.42 / 1M tokensCached: $0.15 / 1M tokensCapabilities: tools, reasoningAccepts: textReturns: text

Access kimi-k2-thinking through LangDB AI Gateway

Recommended

Integrate with moonshotai's kimi-k2-thinking and 250+ other models through a unified API. Monitor usage, control costs, and enhance security.

Unified API

Cost Optimization

Enterprise Security

Get Started Now

Free tier available • No credit card required

Instant Setup

99.9% Uptime

10,000+Monthly Requests

Benchmark Results for kimi-k2-thinking

Category Performance Scores:

Academia: Score 75.40 (Top 2% - Rank #8)
Finance: Score 80.85 (Top 2% - Rank #8)
Marketing: Score 72.41 (Top 3% - Rank #11)
Maths: Score 94.70 (Top 0% - Rank #0)
Programming: Score 52.20 (Top 2% - Rank #8)
Science: Score 77.36 (Top 7% - Rank #25)
Writing: Score 69.60 (Top 7% - Rank #25)

Overall Performance: 74.64507936507937 average score across all categories

Detailed Benchmark Scores:

Benchmark	Score	Percentile	Domain
AA Coding Index	52.20	Top 2%	Programming
AAII	67.00	Top 1%	General
AA Math Index	94.70	Top 0%	Mathematics
GPQA	83.80	Top 4%	STEM (Physics, Chemistry, Biology)
HLE	22.30	Top 2%	General Knowledge
LiveCodeBench	85.30	Top 4%	Programming
MMLU-Pro	84.80	Top 8%	General Knowledge
SciCode	42.40	Top 4%	Scientific

GPQA Score: 83.80 - Graduate-level reasoning benchmark

Model Comparison:

Provider: openrouter

Model Type: completions

Context Size: 262144 tokens

Comparing against 354 models in the database

Category Scores

academia#4 finance#3 marketing#6 maths#1 programming#5

Benchmark Tests

View Other Benchmarks

AA Coding Index

52.2

Programming

AAII

67.0

General

AA Math Index

94.7

Mathematics

GPQA

83.8

STEM (Physics, Chemistry, Biology)

HLE

22.3

General Knowledge

LiveCodeBench

85.3

Programming

MMLU-Pro

84.8

General Knowledge

SciCode

42.4

Scientific

Metric	AA Coding Index	AAII	AA Math Index	GPQA	HLE	LiveCodeBench	MMLU-Pro	SciCode
Score	52.2	67.0	94.7	83.8	22.3	85.3	84.8	42.4

Compare with Similar Models

gemini-2.5-pro-preview

Code Examples

Integration samples and API usage

Code Samples for kimi-k2-thinking

Python SDK Example:

from openai import OpenAI

client = OpenAI(
    base_url="https://api.us-east-1.langdb.ai//projects/<your_project_id>",
    api_key="<your_api_key>"
)

response = client.chat.completions.create(
    model="kimi-k2-thinking",
    messages=[
        {"role": "user", "content": "Hello, how are you?"}
    ]
)

print(response.choices[0].message.content)

TypeScript SDK Example:

import OpenAI from 'openai';

const client = new OpenAI({
    baseURL: "https://api.us-east-1.langdb.ai//projects/<your_project_id>",
    apiKey: "<your_api_key>"
});

const response = await client.chat.completions.create({
    model: "kimi-k2-thinking",
    messages: [
        { role: "user", content: "Hello, how are you?" }
    ]
});

console.log(response.choices[0].message.content);

cURL Example:

curl -X POST "https://api.us-east-1.langdb.ai//projects/<your_project_id>/v1/chat/completions" \
  -H "Authorization: Bearer <your_api_key>" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "kimi-k2-thinking",
    "messages": [
        {"role": "user", "content": "Hello, how are you?"}
    ]
  }'

Model: kimi-k2-thinking

Provider: openrouter

API Endpoint: $https://api.us-east-1.langdb.ai/

Create API Key

Related Models

Similar models from openrouter

kimi-k2-thinking

kimi-k2-thinking

Access kimi-k2-thinking through LangDB AI Gateway

Category Scores

Benchmark Tests

Compare with Similar Models

Code Examples

Related Models

afm-4.5b

aion-1.0

aion-1.0-mini

aion-rp-llama-3.1-8b

coder-large

cogito-v2-preview-deepseek-671b

kimi-k2-thinking by openrouter - AI Model Details, Pricing, and Performance Metrics

kimi-k2-thinking

kimi-k2-thinking

Access kimi-k2-thinking through LangDB AI Gateway

Category Scores

Benchmark Tests

Compare with Similar Models

Code Examples

Related Models

afm-4.5b

aion-1.0

aion-1.0-mini

aion-rp-llama-3.1-8b

coder-large

cogito-v2-preview-deepseek-671b