gpt-oss-20b by deepinfra - AI Model Details, Pricing, and Performance Metrics

openai

gpt-oss-20b

completions
bydeepinfra

gpt-oss-20b is an open-weight 21B parameter model released by OpenAI under the Apache 2.0 license. It uses a Mixture-of-Experts (MoE) architecture with 3.6B active parameters per forward pass, optimized for lower-latency inference and deployability on consumer or single-GPU hardware. The model is trained in OpenAI’s Harmony response format and supports reasoning level configuration, fine-tuning, and agentic capabilities including function calling, tool use, and structured outputs.

Released
Aug 5, 2025
Knowledge
Feb 6, 2025
License
Apache-2.0
Context
131072
Input
$0.04 / 1M tokens
Output
$0.16 / 1M tokens
Capabilities: tools
Accepts: text
Returns: text

Access gpt-oss-20b through LangDB AI Gateway

Recommended

Integrate with openai's gpt-oss-20b and 250+ other models through a unified API. Monitor usage, control costs, and enhance security.

Unified API
Cost Optimization
Enterprise Security
Get Started Now

Free tier available • No credit card required

Instant Setup
99.9% Uptime
10,000+Monthly Requests
Request Volume
Daily API requests
7
Performance (TPS)
Tokens per second
30.09 tokens/s

Category Scores

Benchmark Tests

View Other Benchmarks
HLE
9.8
General Knowledge
GPQA
70.2
STEM (Physics, Chemistry, Biology)
MMLU
85.3
General Knowledge
SciCode
34.4
Scientific
MMLU-Pro
74.8
General Knowledge
LiveCodeBench
77.7
Programming
AA Math Index
89.3
Mathematics
AA Coding Index
40.7
Programming
AAII
52.4
General

Code Examples

Integration samples and API usage