mimo-v2-omni by openrouter - AI Model Details, Pricing, and Performance Metrics

xiaomi
mimo-v2-omni
Try
xiaomi

mimo-v2-omni

completions
byopenrouter

MiMo-V2-Omni is a frontier omni-modal model that natively processes image, video, and audio inputs within a unified architecture. It combines strong multimodal perception with agentic capability - visual grounding, multi-step planning, tool use, and code execution - making it well-suited for complex real-world tasks that span modalities, 256K context window.

Released
Mar 18, 2026
Knowledge
Sep 19, 2025
License
Proprietary
Context
262144
Input
$0.4 / 1M tokens
Output
$2 / 1M tokens
Cached
$0.08 / 1M tokens
Capabilities: tools, reasoning
Accepts: text, image
Returns: text

Access mimo-v2-omni through LangDB AI Gateway

Recommended

Integrate with xiaomi's mimo-v2-omni and 250+ other models through a unified API. Monitor usage, control costs, and enhance security.

Unified API
Cost Optimization
Enterprise Security
Get Started Now

Free tier available • No credit card required

Instant Setup
99.9% Uptime
10,000+Monthly Requests

Category Scores

Benchmark Tests

View Other Benchmarks
HLE
19.9
General Knowledge
GPQA
82.8
STEM (Physics, Chemistry, Biology)
SciCode
36.7
Scientific
AA Coding Index
35.5
Programming
AAII
43.4
General

Code Examples

Integration samples and API usage