omniparser-autogui-mcp

omniparser-autogui-mcp

Public
NON906/omniparser-autogui-mcp

Analyzes screen content using OmniParser and automates GUI operations via Model Context Protocol, supporting customizable window targeting and remote processing for enhanced workflow automation on Windows.

python
0 tools
May 30, 2025
Updated Jun 4, 2025

Supercharge Your AI with omniparser-autogui-mcp

MCP Server

Unlock the full potential of omniparser-autogui-mcp through LangDB's AI Gateway. Get enterprise-grade security, analytics, and seamless integration with zero configuration.

Unified API Access
Complete Tracing
Instant Setup
Get Started Now

Free tier available • No credit card required

Instant Setup
99.9% Uptime
10,000+Monthly Requests
Configuration Requirements
none
Configure authentication and required variables to access this MCP server
Required Environment Variables
OMNI_PARSER_BACKEND_LOAD
Optional
string

Set to 1 if it does not work with other clients (such as LibreChat)

OMNI_PARSER_SERVER
Optional
string

Address and port of the server if you want OmniParser processing to be done on another device (e.g. 127.0.0.1:8000)

PYTHONIOENCODING
Optional
string

Python IO encoding setting

Default: utf-8
SSE_HOST
Optional
string

Host for SSE communication instead of stdio

SSE_PORT
Optional
string

Port for SSE communication instead of stdio

CAPTION_MODEL_PATH
Optional
string

Path to caption model for OmniParser configuration

OCR_LANG
Optional
string

Language setting for OCR

Default: en
BOX_TRESHOLD
Optional
string

Box threshold setting for OmniParser configuration

SOM_MODEL_PATH
Optional
string

Path to SOM model for OmniParser configuration

OMNI_PARSER_DEVICE
Optional
string

Device setting for OmniParser configuration

CAPTION_MODEL_NAME
Optional
string

Caption model name for OmniParser configuration

Security Notice

Your environment variables and credentials are securely stored and encrypted. LangDB never shares these configuration values with third parties.

Related MCPs5
  • MCP Windows Desktop Automation

    Model Context Protocol server enabling Windows desktop automation with full AutoIt function integration, supporting mouse, keyboard, window, process, and system operations, plus file access, screenshots, and automation prompt templates via stdio or WebSocket transport.

    Added May 30, 2025
  • cloudflare-browser-rendering-mcp

    Provides Model Context Protocol tools for fetching, processing, summarizing, and extracting structured web content using Cloudflare Browser Rendering, supporting enhanced LLM context and documentation search capabilities.

    5 tools
    Added May 29, 2025
  • Computer Control MCP

    Provides Model Context Protocol (MCP) capabilities for computer control including mouse and keyboard automation, screen capture with OCR, window management, and drag-and-drop operations using PyAutoGUI, RapidOCR, and ONNXRuntime without external dependencies.

    Added May 29, 2025
  • DocGen MCP Server

    Automates standardized documentation generation from GitHub and Google Drive sources using templates, supports multiple file types, tracks document history, and integrates AI-enhanced content via Perplexity within the Model Context Protocol framework.

    3 tools
    Added May 30, 2025
  • PlayCanvas Editor MCP Server

    Automates the PlayCanvas Editor using an LLM via Model Context Protocol, offering comprehensive tools for managing entities, assets, scenes, and store content with enhanced integration through Claude or Cursor clients.

    Added May 30, 2025