kimi-linear-48b-a3b-instruct

completions

byopenrouter

kimi-linear-48b-a3b-instruct

completions

Published by: moonshotaiProvider:

openrouter

Kimi Linear is a hybrid linear attention architecture that outperforms traditional full attention methods across various contexts, including short, long, and reinforcement learning (RL) scaling regimes. At its core is Kimi Delta Attention (KDA)—a refined version of Gated DeltaNet that introduces a more efficient gating mechanism to optimize the use of finite-state RNN memory. Kimi Linear achieves superior performance and hardware efficiency, especially for long-context tasks. It reduces the need for large KV caches by up to 75% and boosts decoding throughput by up to 6x for contexts as long as 1M tokens.

Released

Oct 30, 2025

Knowledge

May 3, 2025

Context

1048576

Input

$0.3 / 1M tokens

Output

$0.6 / 1M tokens

Accepts: text

Returns: text

Released Oct 30, 2025Knowledge Cutoff: May 3, 2025

Context: 1048576 Input: $0.3 / 1M tokensOutput: $0.6 / 1M tokensAccepts: textReturns: text

Access kimi-linear-48b-a3b-instruct through LangDB AI Gateway

Recommended

Integrate with moonshotai's kimi-linear-48b-a3b-instruct and 250+ other models through a unified API. Monitor usage, control costs, and enhance security.

Unified API

Cost Optimization

Enterprise Security

Get Started Now

Free tier available • No credit card required

Instant Setup

99.9% Uptime

10,000+Monthly Requests

Benchmark Results for kimi-linear-48b-a3b-instruct

Category Performance Scores:

Maths: Score 36.30 (Top 59% - Rank #190)
Finance: Score 31.20 (Top 62% - Rank #200)
Science: Score 43.65 (Top 85% - Rank #274)
Writing: Score 34.52 (Top 87% - Rank #281)
Academia: Score 33.65 (Top 75% - Rank #242)
Marketing: Score 35.44 (Top 81% - Rank #261)
Programming: Score 22.80 (Top 71% - Rank #229)

Overall Performance: 33.93714285714286 average score across all categories

Detailed Benchmark Scores:

Benchmark	Score	Percentile	Domain
HLE	2.70	Top 99%	General Knowledge
GPQA	41.20	Top 81%	STEM (Physics, Chemistry, Biology)
SciCode	19.90	Top 81%	Scientific
MMLU-Pro	58.50	Top 84%	General Knowledge
LiveCodeBench	37.80	Top 52%	Programming
AA Math Index	36.30	Top 59%	Mathematics
AA Coding Index	22.80	Top 71%	Programming
AAII	26.10	Top 71%	General

GPQA Score: 41.20 - Graduate-level reasoning benchmark

Model Comparison:

Provider: openrouter

Model Type: completions

Context Size: 1048576 tokens

Comparing against 322 models in the database

Category Scores

Benchmark Tests

View Other Benchmarks

HLE

2.7

General Knowledge

GPQA

41.2

STEM (Physics, Chemistry, Biology)

SciCode

19.9

Scientific

MMLU-Pro

58.5

General Knowledge

LiveCodeBench

37.8

Programming

AA Math Index

36.3

Mathematics

AA Coding Index

22.8

Programming

AAII

26.1

General

Metric	HLE	GPQA	SciCode	MMLU-Pro	LiveCodeBench	AA Math Index	AA Coding Index	AAII
Score	2.7	41.2	19.9	58.5	37.8	36.3	22.8	26.1

Compare with Similar Models

claude-opus-4.5

gemini-3-flash-preview

Code Examples

Integration samples and API usage

Code Samples for kimi-linear-48b-a3b-instruct

Python SDK Example:

from openai import OpenAI

client = OpenAI(
    base_url="https://api.us-east-1.langdb.ai/projects/<your_project_id>",
    api_key="<your_api_key>"
)

response = client.chat.completions.create(
    model="kimi-linear-48b-a3b-instruct",
    messages=[
        {"role": "user", "content": "Hello, how are you?"}
    ]
)

print(response.choices[0].message.content)

TypeScript SDK Example:

import OpenAI from 'openai';

const client = new OpenAI({
    baseURL: "https://api.us-east-1.langdb.ai/projects/<your_project_id>",
    apiKey: "<your_api_key>"
});

const response = await client.chat.completions.create({
    model: "kimi-linear-48b-a3b-instruct",
    messages: [
        { role: "user", content: "Hello, how are you?" }
    ]
});

console.log(response.choices[0].message.content);

cURL Example:

curl -X POST "https://api.us-east-1.langdb.ai/projects/<your_project_id>/v1/chat/completions" \
  -H "Authorization: Bearer <your_api_key>" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "kimi-linear-48b-a3b-instruct",
    "messages": [
        {"role": "user", "content": "Hello, how are you?"}
    ]
  }'

Model: kimi-linear-48b-a3b-instruct

Provider: openrouter

API Endpoint: $https://api.us-east-1.langdb.ai

Create API Key

Related Models

Similar models from openrouter

kimi-linear-48b-a3b-instruct

kimi-linear-48b-a3b-instruct

Access kimi-linear-48b-a3b-instruct through LangDB AI Gateway

Category Scores

Benchmark Tests

Compare with Similar Models

Code Examples

Related Models

afm-4.5b

aion-1.0

aion-1.0-mini

aion-rp-llama-3.1-8b

coder-large

cogito-v2-preview-deepseek-671b

kimi-linear-48b-a3b-instruct by openrouter - AI Model Details, Pricing, and Performance Metrics

kimi-linear-48b-a3b-instruct

kimi-linear-48b-a3b-instruct

Access kimi-linear-48b-a3b-instruct through LangDB AI Gateway

Category Scores

Benchmark Tests

Compare with Similar Models

Code Examples

Related Models

afm-4.5b

aion-1.0

aion-1.0-mini

aion-rp-llama-3.1-8b

coder-large

cogito-v2-preview-deepseek-671b