grok-3-mini-beta

completions

Grok 3 Mini is a lightweight, smaller thinking model. Unlike traditional models that generate answers immediately, Grok 3 Mini thinks before responding. It’s ideal for reasoning-heavy tasks that don’t demand extensive domain knowledge, and shines in math-specific and quantitative use cases, such as solving challenging puzzles or math problems. Transparent "thinking" traces accessible. Defaults to low reasoning, can boost with setting `reasoning: { effort: "high" }` Note: That there are two xAI endpoints for this model. By default when using this model we will always route you to the base endpoint. If you want the fast endpoint you can add `provider: { sort: throughput}`, to sort by throughput instead.

Input:$0.3 / 1M tokens•$0.07 / 1M tokenscached

Output:$0.5 / 1M tokens

Context:131072 tokens

text

Category Rankings

science#6

Access grok-3-mini-beta through LangDB AI Gateway

Recommended

Integrate with xai's grok-3-mini-beta and 250+ other models through a unified API. Monitor usage, control costs, and enhance security.

Unified API

Cost Optimization

Enterprise Security

Get Started Now

Free tier available • No credit card required

Instant Setup

99.9% Uptime

10,000+Monthly Requests

Code Example

Configuration

Base URL

API Keys

Headers

Project ID in header

X-Run-Id

X-Thread-Id

Model Parameters

11 available

include_reasoning

logprobs

max_tokens

response_format

seed

stop

temperature

012

tool_choice

tools

top_logprobs

top_p

011

Additional Configuration

Tools

Guards

User:

Id:

Name:

Tags:

Publicly Shared Threads5

Integer solutions to 3x + 5y = 100
2 months ago
Integer solutions to 3x+5y=100 are all pairs (x,y) = (30+5t, 2-3t) for any integer t; (30,2) is a particular solution.
diophantine equation
integer solutions
linear equations
solution method
Responsive 2-Column Photo Gallery with Hover Effect
2 months ago
Complete HTML/CSS code for a responsive photo gallery showing 1 column on mobile and 2 columns ≥600px, with a hover scale and box-shadow effect.
responsive photo gallery
css grid layout
image hover effect
two column gallery
Synonyms and nuances of "gregarious"
2 months ago
Three synonyms for "gregarious"—extroverted, convivial, affable—differ by focusing on personality energy, festive sociability, and friendly approachability, respectively.
synonyms for gregarious
nuances of sociable adjectives
definitions and examples of gregarious synonyms
subtle differences in adjective usage
Occurrences and overlaps of "issi" in the sentence
2 months ago
Analysis of “issi” in the sentence shows 2 occurrences starting at indices 1 and 4 (overlapping counted) within “Mississippi.”
substring search
overlapping occurrences
string indexing
case sensitive matching
Tetris Game in Python with Pygame
2 months ago
Complete Python code for a Pygame-based classic Tetris fulfilling all gameplay, controls, scoring, and UI requirements.
python tetris game
pygame tetris implementation
classic tetris gameplay
tetromino game development

Popular Models10

deepseek-chat-v3-0324
deepseek
DeepSeek V3, a 685B-parameter, mixture-of-experts model, is the latest iteration of the flagship chat model family from the DeepSeek team. It succeeds the [DeepSeek V3](/deepseek/deepseek-chat-v3) model and performs really well on a variety of tasks.
Input:$0.25 / 1M tokens
Output:$0.85 / 1M tokens
Context:163840 tokens
text
text
deepseek-r1-0528
deepseek
May 28th update to the [original DeepSeek R1](/deepseek/deepseek-r1) Performance on par with [OpenAI o1](/openai/o1), but open-sourced and with fully open reasoning tokens. It's 671B parameters in size, with 37B active in an inference pass. Fully open-source model.
Input:$0.27 / 1M tokens
Output:$0.27 / 1M tokens
Context:163840 tokens
text
text
gemini-1.5-pro-latest
gemini
Highest intelligence Gemini 1.5 series model, with a breakthrough 2 million token context window.
Input:$1.25 / 1M tokens
Output:$5 / 1M tokens
Context:2M tokens
tools
text
image
audio
video
text
qwen3-235b-a22b-2507
qwen
Qwen3-235B-A22B-Instruct-2507 is a multilingual, instruction-tuned mixture-of-experts language model based on the Qwen3-235B architecture, with 22B active parameters per forward pass. It is optimized for general-purpose text generation, including instruction following, logical reasoning, math, code, and tool usage. The model supports a native 262K context length and does not implement "thinking mode" (<think> blocks). Compared to its base variant, this version delivers significant gains in knowledge coverage, long-context reasoning, coding benchmarks, and alignment with open-ended tasks. It is particularly strong on multilingual understanding, math reasoning (e.g., AIME, HMMT), and alignment evaluations like Arena-Hard and WritingBench.
Input:$0.12 / 1M tokens
Output:$0.12 / 1M tokens
Context:262144 tokens
text
text
deepseek-chat
deepseek
DeepSeek-Chat is an advanced conversational AI model designed to provide intelligent
Input:$0.14 / 1M tokens
Output:$0.28 / 1M tokens
Context:64K tokens
tools
text
text
deepseek-r1
deepseek
DeepSeek R1 is here: Performance on par with [OpenAI o1](/openai/o1), but open-sourced and with fully open reasoning tokens. It's 671B parameters in size, with 37B active in an inference pass. Fully open-source model & [technical report](https://api-docs.deepseek.com/news/news250120). MIT licensed: Distill & commercialize freely!
Input:$0.4 / 1M tokens
Output:$2 / 1M tokens
Context:163840 tokens
text
text
claude-sonnet-4
anthropic
Our high-performance model with exceptional reasoning and efficiency
Input:$3 / 1M tokens
Output:$15 / 1M tokens
Context:200K tokens
tools
text
image
text
claude-3.7-sonnet
anthropic
Intelligent model, with visible step‑by‑step reasoning
Input:$3 / 1M tokens
Output:$15 / 1M tokens
Context:200K tokens
tools
text
text
image
gpt-4o-mini
openai
GPT-4o mini (o for omni) is a fast, affordable small model for focused tasks. It accepts both text and image inputs, and produces text outputs (including Structured Outputs). It is ideal for fine-tuning, and model outputs from a larger model like GPT-4o can be distilled to GPT-4o-mini to produce similar results at lower cost and latency.The knowledge cutoff for GPT-4o-mini models is October, 2023.
Input:$0.15 / 1M tokens
Output:$0.6 / 1M tokens
Context:128K tokens
tools
text
image
text
gpt-4o
openai
High-intelligence flagship model for complex, multi-step tasks. GPT-4o is cheaper and faster than GPT-4 Turbo. It is multimodal (accepting text or image inputs and outputting text), and it has the same high intelligence as GPT-4 Turbo but is much more efficient—it generates text 2x faster and is 50% cheaper. Additionally, GPT-4o has the best vision and performance across non-English languages of any of our models. GPT-4o is available in the OpenAI API to paying customers.
Input:$2.5 / 1M tokens
Output:$10 / 1M tokens
Context:128K tokens
tools
text
image
text