deepseek-r1-0528-qwen3-8b by parasail - AI Model Details, Pricing, and Performance Metrics

deepseek
deepseek-r1-0528-qwen3-8b
deepseek

deepseek-r1-0528-qwen3-8b

completions
byparasail

DeepSeek-R1-0528 is a lightly upgraded release of DeepSeek R1 that taps more compute and smarter post-training tricks, pushing its reasoning and inference to the brink of flagship models like O3 and Gemini 2.5 Pro. It now tops math, programming, and logic leaderboards, showcasing a step-change in depth-of-thought. The distilled variant, DeepSeek-R1-0528-Qwen3-8B, transfers this chain-of-thought into an 8 B-parameter form, beating standard Qwen3 8B by +10 pp and tying the 235 B “thinking” giant on AIME 2024.

Released
Jan 20, 2025
Knowledge
Jul 24, 2024
License
MIT
Context
131072
Input
$0.05 / 1M tokens
Output
$0.1 / 1M tokens
Accepts: text
Returns: text

Access deepseek-r1-0528-qwen3-8b through LangDB AI Gateway

Recommended

Integrate with deepseek's deepseek-r1-0528-qwen3-8b and 250+ other models through a unified API. Monitor usage, control costs, and enhance security.

Unified API
Cost Optimization
Enterprise Security
Get Started Now

Free tier available • No credit card required

Instant Setup
99.9% Uptime
10,000+Monthly Requests
Request Volume
Daily API requests
2
Performance (TPS)
Tokens per second
53.85 tokens/s

Category Scores

Benchmark Tests

View Other Benchmarks
AIME
89.3
Mathematics
AA Coding Index
58.7
Programming
AAII
52.0
General
AA Math Index
76.0
Mathematics
GPQA
81.3
STEM (Physics, Chemistry, Biology)
HLE
14.9
General Knowledge
LiveCodeBench
77.0
Programming
MATH-500
98.3
Mathematics
MMLU-Pro
84.9
General Knowledge
SciCode
40.3
Scientific

Code Examples

Integration samples and API usage