deepseek-v3.1-base by openrouter - AI Model Details, Pricing, and Performance Metrics
deepseek-v3.1-base
completionsdeepseek-v3.1-base
This is a base model, trained only for raw next-token prediction. Unlike instruct/chat models, it has not been fine-tuned to follow user instructions. Prompts need to be written more like training text or examples rather than simple requests (e.g., “Translate the following sentence…” instead of just “Translate this”). DeepSeek-V3.1 Base is a 671B parameter open Mixture-of-Experts (MoE) language model with 37B active parameters per forward pass and a context length of 128K tokens. Trained on 14.8T tokens using FP8 mixed precision, it achieves high training efficiency and stability, with strong performance across language, reasoning, math, and coding tasks.
This is a base model, trained only for raw next-token prediction. Unlike instruct/chat models, it has not been fine-tuned to follow user instructions. Prompts need to be written more like training text or examples rather than simple requests (e.g., “Translate the following sentence…” instead of just “Translate this”). DeepSeek-V3.1 Base is a 671B parameter open Mixture-of-Experts (MoE) language model with 37B active parameters per forward pass and a context length of 128K tokens. Trained on 14.8T tokens using FP8 mixed precision, it achieves high training efficiency and stability, with strong performance across language, reasoning, math, and coding tasks.
Access deepseek-v3.1-base through LangDB AI Gateway
Integrate with deepseek's deepseek-v3.1-base and 250+ other models through a unified API. Monitor usage, control costs, and enhance security.
Free tier available • No credit card required
Statistics
Category Scores
Benchmark Tests
Metric | AIME | AA Coding Index | AAII | AA Math Index | DROP | GPQA | HLE | LiveCodeBench | MATH-500 | MMLU | MMLU-Pro | SciCode |
---|---|---|---|---|---|---|---|---|---|---|---|---|
Score | 25.3 | 35.6 | 32.5 | 26.0 | 91.6 | 57.4 | 3.6 | 35.9 | 88.7 | 88.5 | 75.6 | 35.4 |
Compare with Similar Models
Code Examples
Integration samples and API usage
Related Models
Similar models from openrouter