deepseek-chat-v3.1

completions

bydeepinfra

deepseek-chat-v3.1

completions

Published by: deepseekProvider:

deepinfra

DeepSeek-V3.1 is a large hybrid reasoning model (671B parameters, 37B active) that supports both thinking and non-thinking modes via prompt templates. It extends the DeepSeek-V3 base with a two-phase long-context training process, reaching up to 128K tokens, and uses FP8 microscaling for efficient inference. Users can control the reasoning behaviour with the `reasoning` `enabled` boolean. [Learn more in our docs](https://openrouter.ai/docs/use-cases/reasoning-tokens#enable-reasoning-with-default-config) The model improves tool use, code generation, and reasoning efficiency, achieving performance comparable to DeepSeek-R1 on difficult benchmarks while responding more quickly. It supports structured tool calling, code agents, and search agents, making it suitable for research, coding, and agentic workflows. It succeeds the [DeepSeek V3-0324](/deepseek/deepseek-chat-v3-0324) model and performs well on a variety of tasks.

Released

Aug 21, 2025

Knowledge

Feb 22, 2025

Context

163840

Input

$0.3 / 1M tokens

Output

$1 / 1M tokens

Capabilities: tools

Accepts: text

Returns: text

Released Aug 21, 2025Knowledge Cutoff: Feb 22, 2025

Context: 163840 Input: $0.3 / 1M tokensOutput: $1 / 1M tokensCapabilities: toolsAccepts: textReturns: text

Access deepseek-chat-v3.1 through LangDB AI Gateway

Recommended

Integrate with deepseek's deepseek-chat-v3.1 and 250+ other models through a unified API. Monitor usage, control costs, and enhance security.

Unified API

Cost Optimization

Enterprise Security

Get Started Now

Free tier available • No credit card required

Instant Setup

99.9% Uptime

10,000+Monthly Requests

Benchmark Results for deepseek-chat-v3.1

Category Performance Scores:

Academia: Score 59.15 (Top 20% - Rank #65)
Marketing: Score 31.97 (Top 44% - Rank #143)
Programming: Score 47.20 (Top 18% - Rank #59)
Science: Score 73.50 (Top 21% - Rank #69)
Writing: Score 24.87 (Top 47% - Rank #153)

Overall Performance: 47.338 average score across all categories

Detailed Benchmark Scores:

Benchmark	Score	Percentile	Domain
AA Coding Index	47.20	Top 18%	Programming
AAII	44.80	Top 18%	General
GPQA	73.50	Top 21%	STEM (Physics, Chemistry, Biology)
HLE	6.30	Top 21%	General Knowledge
LiveCodeBench	57.70	Top 21%	Programming
MMLU-Pro	83.30	Top 12%	General Knowledge
SciCode	36.70	Top 14%	Scientific

GPQA Score: 73.50 - Graduate-level reasoning benchmark

Model Comparison:

Provider: deepinfra

Model Type: completions

Context Size: 163840 tokens

Comparing against 324 models in the database

Performance Analytics for deepseek-chat-v3.1

Usage Statistics (Last 4 Days):

Total Requests: 154 API calls
Average TPS: 1276.95 tokens per second
Average Response Time: 9834.60ms
Average Time to First Token: 960.80ms
Total Cost: $0.73
Average Request Cost: $0.0048

Daily Performance Breakdown:

Date	Requests	TPS	Response Time	TTFT	Cost
9/5/2025	59	1001.99	9816.60ms	819.40ms	$0.22
9/6/2025	70	1432.58	9870.50ms	931.00ms	$0.37
9/7/2025	24	1540.72	9633.90ms	885.20ms	$0.13
9/9/2025	1	574.98	13205.80ms	13205.80ms	$0.00

Performance Summary:

Model: deepseek-chat-v3.1 by deepinfra

Monitoring Period: 9/4/2025 to 9/11/2025

Average Daily Requests: 39

Peak Daily Requests: 70

Statistics

View Full Statistics

Request Volume

Daily API requests

154

Performance (TPS)

Tokens per second

1276.95 tokens/s

Category Scores

programming#19

Benchmark Tests

View Other Benchmarks

AA Coding Index

47.2

Programming

AAII

44.8

General

GPQA

73.5

STEM (Physics, Chemistry, Biology)

HLE

6.3

General Knowledge

LiveCodeBench

57.7

Programming

MMLU-Pro

83.3

General Knowledge

SciCode

36.7

Scientific

Metric	AA Coding Index	AAII	GPQA	HLE	LiveCodeBench	MMLU-Pro	SciCode
Score	47.2	44.8	73.5	6.3	57.7	83.3	36.7

Compare with Similar Models

gemini-2.5-pro-preview

grok-4

Code Examples

Integration samples and API usage

Code Samples for deepseek-chat-v3.1

Python SDK Example:

from openai import OpenAI

client = OpenAI(
    base_url="https://api.us-east-1.langdb.ai/projects/<your_project_id>",
    api_key="<your_api_key>"
)

response = client.chat.completions.create(
    model="deepseek-chat-v3.1",
    messages=[
        {"role": "user", "content": "Hello, how are you?"}
    ]
)

print(response.choices[0].message.content)

TypeScript SDK Example:

import OpenAI from 'openai';

const client = new OpenAI({
    baseURL: "https://api.us-east-1.langdb.ai/projects/<your_project_id>",
    apiKey: "<your_api_key>"
});

const response = await client.chat.completions.create({
    model: "deepseek-chat-v3.1",
    messages: [
        { role: "user", content: "Hello, how are you?" }
    ]
});

console.log(response.choices[0].message.content);

cURL Example:

curl -X POST "https://api.us-east-1.langdb.ai/projects/<your_project_id>/v1/chat/completions" \
  -H "Authorization: Bearer <your_api_key>" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "deepseek-chat-v3.1",
    "messages": [
        {"role": "user", "content": "Hello, how are you?"}
    ]
  }'

Model: deepseek-chat-v3.1

Provider: deepinfra

API Endpoint: $https://api.us-east-1.langdb.ai

Create API Key

Related Models

Similar models from deepinfra

deepseek-chat-v3.1

deepseek-chat-v3.1

Access deepseek-chat-v3.1 through LangDB AI Gateway

Statistics

Category Scores

Benchmark Tests

Compare with Similar Models

Code Examples

Related Models

deepseek-chat-v3-0324

deepseek-prover-v2

DeepSeek-R1

deepseek-r1-0528

DeepSeek-R1-Distill-Llama-70B

DeepSeek-R1-Distill-Qwen-32B

deepseek-chat-v3.1 by deepinfra - AI Model Details, Pricing, and Performance Metrics

deepseek-chat-v3.1

deepseek-chat-v3.1

Access deepseek-chat-v3.1 through LangDB AI Gateway

Statistics

Category Scores

Benchmark Tests

Compare with Similar Models

Code Examples

Related Models

deepseek-chat-v3-0324

deepseek-prover-v2

DeepSeek-R1

deepseek-r1-0528

DeepSeek-R1-Distill-Llama-70B

DeepSeek-R1-Distill-Qwen-32B