Related MCP Server Resources

Explore more AI models, providers, and integration options:

  • Explore AI Models
  • Explore AI Providers
  • Explore MCP Servers
  • LangDB Pricing
  • Documentation
  • AI Industry Blog
  • MCP Server
  • Figma MCP Server
  • Xano MCP Server
  • DocGen MCP Server
  • Powertools MCP Search Server
Back to MCP Servers
Website Scraper MCP Server

Website Scraper MCP Server

Public
tolik-unicornrider/mcp_scraper

Command-line tool and Model Context Protocol server that scrapes websites, extracts clean content using Mozilla's Readability, securely converts HTML to high-quality Markdown, and supports both web URLs and local HTML files.

javascript
0 tools
May 29, 2025
Updated Jun 4, 2025

Supercharge Your AI with Website Scraper MCP Server

MCP Server

Unlock the full potential of Website Scraper MCP Server through LangDB's AI Gateway. Get enterprise-grade security, analytics, and seamless integration with zero configuration.

Unified API Access
Complete Tracing
Instant Setup
Get Started Now

Free tier available • No credit card required

Instant Setup
99.9% Uptime
10,000+Monthly Requests

Website Scraper

A command-line tool and MCP server for scraping websites and converting HTML to Markdown.

Features

  • Extracts meaningful content from web pages using Mozilla's Readability library (the same engine used in Firefox's Reader View)
  • Converts clean HTML to high-quality Markdown with TurndownService
  • Securely handles HTML by removing potentially harmful script tags
  • Works as both a command-line tool and an MCP server
  • Supports direct conversion of local HTML files to Markdown

Installation

# Install dependencies npm install # Build the project npm run build # Optionally, install globally npm install -g .

Usage

CLI Mode

# Print output to console scrape https://example.com # Save output to a file scrape https://example.com output.md # Convert a local HTML file to Markdown scrape --html-file input.html # Convert a local HTML file and save output to a file scrape --html-file input.html output.md # Show help scrape --help # Or run via npm script npm run start:cli -- https://example.com

MCP Server Mode

This tool can be used as a Model Context Protocol (MCP) server:

# Start in MCP server mode npm start

Code Structure

  • src/index.ts - Core functionality and MCP server implementation
  • src/cli.ts - Command-line interface implementation
  • src/data_processing.ts - HTML to Markdown conversion functionality

API

The tool exports the following functions:

// Scrape a website and convert to Markdown import { scrapeToMarkdown } from './build/index.js'; // Convert HTML string to Markdown directly import { htmlToMarkdown } from './build/data_processing.js'; async function example() { // Web scraping const markdown = await scrapeToMarkdown('https://example.com'); console.log(markdown); // Direct HTML conversion const html = 'Hello WorldThis is bold text.'; const md = htmlToMarkdown(html); console.log(md); }

License

ISC

Publicly Shared Threads0

Discover shared experiences

Shared threads will appear here, showcasing real-world applications and insights from the community. Check back soon for updates!

Share your threads to help others
Related MCPs5
  • MCP Server
    MCP Server

    Browse and interact with your entire Notion workspace via a Model Context Protocol server that conve...

    Added May 30, 2025
  • Figma MCP Server
    Figma MCP Server

    Model Context Protocol server bridging Figma designs and React by converting design files into React...

    Added May 30, 2025
  • Xano MCP Server
    Xano MCP Server

    Python-based Model Context Protocol server enabling AI assistants to securely interact with Xano ins...

    Added May 30, 2025
  • DocGen MCP Server
    DocGen MCP Server

    Automates standardized documentation generation from GitHub and Google Drive sources using templates...

    3 tools
    Added May 30, 2025
  • Powertools MCP Search Server
    Powertools MCP Search Server

    Model Context Protocol server enabling efficient local search of AWS Lambda Powertools documentation...

    2 tools
    Added May 30, 2025