A PDF processing server that extracts text via normal parsing or OCR, and retrieves images from PDF files through the MCP protocol with a built-in web debugger.
A PDF file reading server based on FastMCP.
Supports PDF text extraction, OCR recognition, and image extraction via the MCP protocol, with a built-in web debugger for easy testing.
read_pdf_text
Extracts normal text from a PDF (page by page).
read_by_ocr
Uses OCR to recognize text from scanned or image-based PDFs.
read_pdf_images
Extracts all images from a specified PDF page (Base64 encoded output).
mcp-pdf-server/
āāā pdf_resources/ # Directory for uploaded and processed PDF files
āāā txt_server.py # Main server entry point
āāā README.md # Project documentation
Recommended Python version: 3.9+
pip install pymupdf mcp
Note: To use OCR features, you may need a MuPDF build with OCR support or external OCR libraries.
Run the following command:
python txt_server.py
You should see logs like:
Serving on http://127.0.0.1:6231
Open your browser and visit:
http://127.0.0.1:6231
No coding required ā easily debug and test via the web UI.
Tool | Description | Input Parameters | Returns |
---|---|---|---|
read_pdf_text | Extracts normal text from PDF pages | file_path , start_page , end_page | List of page texts |
read_by_ocr | Recognizes text via OCR | file_path , start_page , end_page , language , dpi | OCR extracted text |
read_pdf_images | Extracts images from a PDF page | file_path , page_number | List of images (Base64 encoded) |
Extract text from pages 1 to 5:
mcp run read_pdf_text --args '{"file_path": "pdf_resources/example.pdf", "start_page": 1, "end_page": 5}'
Perform OCR recognition on page 1:
mcp run read_by_ocr --args '{"file_path": "pdf_resources/example.pdf", "start_page": 1, "end_page": 1, "language": "eng"}'
Extract all images from page 3:
mcp run read_pdf_images --args '{"file_path": "pdf_resources/example.pdf", "page_number": 3}'
pdf_resources/
directory, or an absolute path must be provided.This project is licensed under the MIT License.
For commercial use, please credit the original source.
Discover shared experiences
Shared threads will appear here, showcasing real-world applications and insights from the community. Check back soon for updates!