Related MCP Server Resources

Explore more AI models, providers, and integration options:

  • Explore AI Models
  • Explore AI Providers
  • Explore MCP Servers
  • LangDB Pricing
  • Documentation
  • AI Industry Blog
  • MkDocs MCP Search Server
  • Amazon VPC Lattice MCP Server
  • Image Generation MCP Server
  • Gemini MCP Image Generation Server
  • Image Processor MCP Server
Back to MCP Servers
Doc Scraper MCP Server

Doc Scraper MCP Server

Public
askjohngeorge/mcp-doc-scraper

Converts web-based documentation into markdown format using jina.ai's conversion service, allowing users to scrape documentation from any URL and save it as markdown files.

Verified
python
0 tools
May 29, 2025
Updated May 30, 2025

Doc Scraper MCP Server

A Model Context Protocol (MCP) server that provides documentation scraping functionality. This server converts web-based documentation into markdown format using jina.ai's conversion service.

Features

  • Scrapes documentation from any web URL
  • Converts HTML documentation to markdown format
  • Saves the converted documentation to a specified output path
  • Integrates with the Model Context Protocol (MCP)

Installation

Installing via Smithery

To install Doc Scraper for Claude Desktop automatically via Smithery:

npx -y @smithery/cli install @askjohngeorge/mcp-doc-scraper --client claude
  1. Clone the repository:
git clone https://github.com/askjohngeorge/mcp-doc-scraper.git cd mcp-doc-scraper
  1. Create and activate a virtual environment:
python -m venv venv source venv/bin/activate # On Windows, use: venv\Scripts\activate
  1. Install the dependencies:
pip install -e .

Usage

The server can be run using Python:

python -m mcp_doc_scraper

Tool Description

The server provides a single tool:

  • Name: scrape_docs
  • Description: Scrape documentation from a URL and save as markdown
  • Input Parameters:
    • url: The URL of the documentation to scrape
    • output_path: The path where the markdown file should be saved

Project Structure

doc_scraper/
├── __init__.py
├── __main__.py
└── server.py

Dependencies

  • aiohttp
  • mcp
  • pydantic

Development

To set up the development environment:

  1. Install development dependencies:
pip install -r requirements.txt
  1. The server uses the Model Context Protocol. Make sure to familiarize yourself with MCP documentation.

License

MIT License

Publicly Shared Threads0

Discover shared experiences

Shared threads will appear here, showcasing real-world applications and insights from the community. Check back soon for updates!

Share your threads to help others
Related MCPs5
  • MkDocs MCP Search Server
    MkDocs MCP Search Server

    Enables Claude and other LLMs to search through any published MkDocs documentation site using the Lu...

    Added May 30, 2025
  • Amazon VPC Lattice MCP Server
    Amazon VPC Lattice MCP Server

    A Model Context Protocol server that provides tools for accessing and managing AWS VPC Lattice infor...

    2 tools
    Added May 30, 2025
  • Image Generation MCP Server
    Image Generation MCP Server

    A Model Context Protocol server that enables generation of high-quality images using the Flux.1 Schn...

    1 tools
    Added May 30, 2025
  • Gemini MCP Image Generation Server
    Gemini MCP Image Generation Server

    A Model Context Protocol server that provides image generation capabilities using Google's Gemini 2 ...

    1 tools
    Added May 30, 2025
  • Image Processor MCP Server
    Image Processor MCP Server

    A TypeScript-based MCP server that implements a simple notes system, allowing users to create, acces...

    2 tools
    Added May 30, 2025