deepseek-r1-0528-qwen3-8b by parasail - AI Model Details, Pricing, and Performance Metrics

deepseek
deepseek-r1-0528-qwen3-8b
Try
deepseek

deepseek-r1-0528-qwen3-8b

completions
byparasail

DeepSeek-R1-0528 is a lightly upgraded release of DeepSeek R1 that taps more compute and smarter post-training tricks, pushing its reasoning and inference to the brink of flagship models like O3 and Gemini 2.5 Pro. It now tops math, programming, and logic leaderboards, showcasing a step-change in depth-of-thought. The distilled variant, DeepSeek-R1-0528-Qwen3-8B, transfers this chain-of-thought into an 8 B-parameter form, beating standard Qwen3 8B by +10 pp and tying the 235 B “thinking” giant on AIME 2024.

Released
Jan 20, 2025
Knowledge
Jul 24, 2024
License
MIT
Context
131072
Input
$0.05 / 1M tokens
Output
$0.1 / 1M tokens
Accepts: text
Returns: text

Access deepseek-r1-0528-qwen3-8b through LangDB AI Gateway

Recommended

Integrate with deepseek's deepseek-r1-0528-qwen3-8b and 250+ other models through a unified API. Monitor usage, control costs, and enhance security.

Unified API
Cost Optimization
Enterprise Security
Get Started Now

Free tier available • No credit card required

Instant Setup
99.9% Uptime
10,000+Monthly Requests

Category Scores

Benchmark Tests

View Other Benchmarks
HLE
14.9
General Knowledge
AIME
89.3
Mathematics
GPQA
81.3
STEM (Physics, Chemistry, Biology)
SciCode
40.3
Scientific
MATH-500
98.3
Mathematics
MMLU-Pro
84.9
General Knowledge
LiveCodeBench
77.0
Programming
AA Math Index
76.0
Mathematics
AA Coding Index
44.1
Programming
AAII
52.0
General

Code Examples

Integration samples and API usage