Provides documentation scraping by converting web-based docs from any URL into markdown format using Model Context Protocol integration for seamless content extraction and storage.
Unlock the full potential of Doc Scraper MCP Server through LangDB's AI Gateway. Get enterprise-grade security, analytics, and seamless integration with zero configuration.
Free tier available • No credit card required
A Model Context Protocol (MCP) server that provides documentation scraping functionality. This server converts web-based documentation into markdown format using jina.ai's conversion service.
To install Doc Scraper for Claude Desktop automatically via Smithery:
npx -y @smithery/cli install @askjohngeorge/mcp-doc-scraper --client claude
git clone https://github.com/askjohngeorge/mcp-doc-scraper.git cd mcp-doc-scraper
python -m venv venv source venv/bin/activate # On Windows, use: venv\Scripts\activate
pip install -e .
The server can be run using Python:
python -m mcp_doc_scraper
The server provides a single tool:
scrape_docs
url
: The URL of the documentation to scrapeoutput_path
: The path where the markdown file should be saveddoc_scraper/
├── __init__.py
├── __main__.py
└── server.py
To set up the development environment:
pip install -r requirements.txt
MIT License
Discover shared experiences
Shared threads will appear here, showcasing real-world applications and insights from the community. Check back soon for updates!