mercury by openrouter - AI Model Details, Pricing, and Performance Metrics

inception
mercury
inception

mercury

completions
byopenrouter

Mercury is the first diffusion large language model (dLLM). Applying a breakthrough discrete diffusion approach, the model runs 5-10x faster than even speed optimized models like GPT-4.1 Nano and Claude 3.5 Haiku while matching their performance. Mercury's speed enables developers to provide responsive user experiences, including with voice agents, search interfaces, and chatbots. Read more in the blog post here.

Context
128K
Input
$0.25 / 1M tokens
Output
$1 / 1M tokens
Capabilities: tools
Accepts: text
Returns: text

Access mercury through LangDB AI Gateway

Recommended

Integrate with inception's mercury and 250+ other models through a unified API. Monitor usage, control costs, and enhance security.

Unified API
Cost Optimization
Enterprise Security
Get Started Now

Free tier available • No credit card required

Instant Setup
99.9% Uptime
10,000+Monthly Requests

Code Examples

Integration samples and API usage