kimi-k2 by deepinfra - AI Model Details, Pricing, and Performance Metrics

moonshotai
kimi-k2
moonshotai

kimi-k2

completions
On:deepinfraparasail

Kimi K2 Instruct is a large-scale Mixture-of-Experts (MoE) language model developed by Moonshot AI, featuring 1 trillion total parameters with 32 billion active per forward pass. It is optimized for agentic capabilities, including advanced tool use, reasoning, and code synthesis. Kimi K2 excels across a broad range of benchmarks, particularly in coding (LiveCodeBench, SWE-bench), reasoning (ZebraLogic, GPQA), and tool-use (Tau2, AceBench) tasks. It supports long-context inference up to 128K tokens and is designed with a novel training stack that includes the MuonClip optimizer for stable large-scale MoE training.

ProviderInputOutput
deepinfra
deepinfra
$0.5 / 1M tokens$2 / 1M tokens
parasail
parasail
$0.99 / 1M tokens$2.99 / 1M tokens
Released
Jan 1, 2025
Knowledge
Jul 5, 2024
Context
131K
Input
$0.5 / 1M tokens
Output
$2 / 1M tokens
Capabilities: tools
Accepts: text
Returns: text

Access kimi-k2 through LangDB AI Gateway

Recommended

Integrate with moonshotai's kimi-k2 and 250+ other models through a unified API. Monitor usage, control costs, and enhance security.

Unified API
Cost Optimization
Enterprise Security
Get Started Now

Free tier available • No credit card required

Instant Setup
99.9% Uptime
10,000+Monthly Requests
Available from 2 providers
Provider:
Request Volume
Daily API requests
9
Performance (TPS)
Tokens per second
1415.63 tokens/s

Category Scores

Benchmark Tests

View Other Benchmarks
AAII
50.4
General
AA Math Index
57.3
Mathematics
GPQA
76.3
STEM (Physics, Chemistry, Biology)
HLE
6.3
General Knowledge
HumanEval
94.5
Programming
LiveCodeBench
61.0
Programming
MMLU
90.2
General Knowledge
MMLU-Pro
82.2
General Knowledge
SciCode
30.7
Scientific

Code Examples

Integration samples and API usage