Back to GalleryBack

Model Comparison

kimi-k2vsclaude-sonnet-4

Model Comparison

Compare performance, benchmarks, and characteristics

kimi-k2

groq

Context131072 tokens

Input Price$1 / 1M tokens

Output Price$3 / 1M tokens

View Details →

claude-sonnet-4

anthropic

Context200K tokens

Input Price$3 / 1M tokens

Output Price$15 / 1M tokens

View Details →

Loading comparison...

Model Comparison

Metric	kimi-k2 groq	claude-sonnet-4 anthropic
Pricing
Input Price	$1 / 1M tokens	$3 / 1M tokens
Output Price	$3 / 1M tokens	$15 / 1M tokens
Capabilities
Context Window	131072 tokens	200K tokens
Capabilities	tools	tools
Input type	text	text, image
Category Scores
Overall Average	56.7	64.9
Academia	63.3	67.1
Finance	53.8	65.4
Marketing	57.6	66.4
Maths	57.3	74.3
Programming	38.1	45.1
Science	70.8	72.7
Writing	56.1	63.5
Benchmark Tests
AIME	_	77.3
AA Coding Index	38.1	45.1
AAII	50.4	56.5
AA Math Index	57.3	74.3
GPQA	76.3	77.7
HLE	6.3	9.6
HumanEval	94.5	_
LiveCodeBench	61.0	65.5
MATH-500	_	99.1
MMLU	90.2	_
MMLU-Pro	82.2	84.2
SciCode	30.7	40.0

kimi-k2 Advantages

• Lower input token cost ($1 / 1M tokens vs $3 / 1M tokens)

claude-sonnet-4 Advantages

• Higher benchmark scores (64.9 vs 56.7)
• Larger context window (200K tokens vs 131072 tokens)

Compare groq/kimi-k2 vs anthropic/claude-sonnet-4