Mastering MCP Servers
MinerU MCP: AI-Powered Document Parsing
Transform PDFs, Word docs, presentations, and images into AI-ready formats using MinerU's parsing API - optimized for Claude Code research workflows with 90%+ accuracy, 109 languages, and batch processing up to 200 documents
MinerU MCP integrates MinerU's document parsing API directly into Claude, enabling AI-powered document analysis without leaving your research workflow.
Info
Created by: Xule Lin | Version: 1.1.3 | GitHub
Key Stats: 90%+ accuracy (VLM mode) | 109 languages | Up to 200 documents per batch | 73% token reduction
Perfect for: Systematic literature reviews, batch PDF processing, research corpus preparation
Tip
Part of the Ecosystem: MinerU is available as an optional MCP in the Interpretive Orchestration Plugin, powering high-accuracy PDF parsing for qualitative research workflows!
What is MinerU MCP?
An MCP server that wraps MinerU's document parsing API, optimized for Claude Code research workflows. Instead of switching between tools or running scripts, you can parse documents directly within your AI conversation.
Why use MinerU MCP instead of manual conversion?
- Integrated workflow: Parse documents without leaving Claude
- Multi-format support: PDF, DOC, DOCX, PPT, PPTX, PNG, JPG, JPEG
- Batch processing: Handle 200 documents simultaneously
- Local file workflow: Upload files from your machine, poll for completion, download results
- Quality options: Choose speed (Pipeline) or accuracy (VLM)
- 73% token reduction: Optimized tool descriptions for efficient context usage
The Six Tools
MinerU MCP provides six tools covering two workflows: URL-based parsing (tools 1--4) and a local file pipeline (tools 5--6).
1. mineru_parse
Process a single document with customizable options.
| Parameter | Description | Default |
|---|---|---|
url | Document URL (required) | - |
model | pipeline (fast) or vlm (accurate) | pipeline |
pages | Page ranges to parse (e.g. "1-10,15") | all pages |
formats | Extra export formats beyond markdown | - |
ocr | Enable OCR for scanned documents | false |
formula | Recognize mathematical/chemical formulas | false |
table | Detect and extract tables | true |
language | OCR language (109 supported) | en |
Example prompt:
Parse pages 1-25 of this paper with VLM mode for maximum accuracy:
https://arxiv.org/pdf/2401.12345.pdf2. mineru_status
Check task completion and get download URLs.
| Parameter | Description | Default |
|---|---|---|
task_id | Task ID from a parse request (required) | - |
format | concise or detailed response | concise |
Example prompt:
Check the status of my parsing job and download the markdown when ready3. mineru_batch
Process multiple document URLs simultaneously — perfect for SLR corpus preparation.
Limits:
- Maximum 200 documents per batch
- 200MB per file, 600 pages per document
- 2000 pages/day at high priority
Example prompt:
Batch process these 50 papers using VLM mode for my literature review:
[list of URLs]4. mineru_batch_status
Retrieve paginated results from batch jobs.
| Parameter | Description | Default |
|---|---|---|
batch_id | Batch ID from a batch request (required) | - |
limit | Number of results to return | - |
offset | Pagination offset | 0 |
format | concise or detailed response | concise |
5. mineru_upload_batch
Upload local files from your machine for batch processing — no need to host files at a URL.
| Parameter | Description | Default |
|---|---|---|
directory | Path to a folder of documents | - |
files | Array of specific file paths | - |
model | pipeline (fast) or vlm (accurate) | pipeline |
formula | Recognize formulas | false |
table | Detect and extract tables | true |
language | OCR language | en |
formats | Extra export formats | - |
Provide either directory or files (not both).
6. mineru_download_results
Download processed results as named markdown files to a local directory.
| Parameter | Description | Default |
|---|---|---|
batch_id | Batch ID to download results for (required) | - |
output_dir | Local directory for output files (required) | - |
overwrite | Overwrite existing files | false |
Tip
Local File Workflow: Tools 5 and 6 enable a complete local pipeline — upload files from your machine with mineru_upload_batch, poll with mineru_batch_status, then save results with mineru_download_results. No URLs or manual downloads needed.
VLM Mode vs Pipeline Mode
Best for: Academic papers, complex layouts, tables, formulas
- 90%+ accuracy using Vision Language Models
- Slower processing (worth the wait for important documents)
- Higher API cost
- Recommended for SLR corpus where accuracy matters
Parse with model='vlm' for maximum accuracyUse Cases for Research
1. SLR Corpus Preparation
Converting 50+ papers for systematic review:
I have 47 papers from my Scopus search that need to be converted
to markdown for analysis. Here are the URLs:
[paste URLs]
Use VLM mode for accurate table extraction. This is for my
systematic literature review on organizational learning.2. Local File Processing
When your papers are already downloaded (e.g., from Zotero):
Upload all PDFs in ~/Documents/slr-papers/ using VLM mode,
then download the results to ~/Documents/slr-markdown/3. Batch Processing for Literature Analysis
Screen a large set before detailed analysis:
Quick parse these 100 papers using pipeline mode to extract
abstracts and key sections. I'll do detailed VLM parsing
on the 20 most relevant ones later.4. Multilingual Research
MinerU supports 109 OCR languages:
Parse this German-language paper with OCR enabled and
language set to 'de'. Extract the methodology section.Installation & Setup
Step 1: Get API Key
- Visit mineru.net
- Create account and generate API key
- Save securely (you'll need it for configuration)
Step 2: Install MCP
claude mcp add mineru-mcp -e MINERU_API_KEY=your-api-key -- npx -y mineru-mcpVerify with claude mcp list — you should see mineru-mcp available.
Info
MinerU MCP supports 11+ client configurations including Windsurf, Cline, Cherry Studio, and Witsy. See the full setup guide on GitHub for all options.
Configuration Options
| Variable | Default | Purpose |
|---|---|---|
MINERU_API_KEY | Required | Bearer token from mineru.net |
MINERU_BASE_URL | https://mineru.net/api/v4 | API endpoint |
MINERU_DEFAULT_MODEL | pipeline | Default parsing mode |
Integration with Research Memex
With OCR Guide
MinerU MCP is the recommended approach for PDF conversion in Research Memex workflows. See the PDF to Markdown Conversion Guide for comparison with other methods.
With SLR Workflow
Use MinerU for batch PDF processing in your Systematic Literature Review workflow. Perfect for converting your Zotero exports to AI-ready markdown.
With Interpretive Orchestration
MinerU is bundled as an optional MCP in the Interpretive Orchestration Plugin for qualitative research. It powers document ingestion alongside Markdownify for a complete document processing pipeline.
MinerU vs Mistral OCR
| Feature | MinerU MCP | Mistral OCR (Script) |
|---|---|---|
| Integration | MCP (inline in Claude) | Python script |
| Best for | Claude workflows, real-time | Bulk offline processing |
| Formats | PDF, DOC, DOCX, PPT, PPTX, images | PDF only |
| Batch limit | 200 docs | Unlimited |
| VLM mode | Yes (90%+) | No |
| Local files | Yes (upload_batch) | Yes |
| Languages | 109 | Variable |
| Setup | API key + MCP | API key + Python |
| Cost | Per-page API | Per-page API |
Recommendation: Use MinerU MCP for integrated Claude workflows and multi-format documents. Use Mistral script for very large offline batch jobs.
Limitations & Considerations
- API key required — Get from mineru.net
- File size: 200MB max per file
- Page limit: 600 pages per document
- Daily quota: 2000 pages at high priority
- VLM mode: More accurate but slower and costlier
Resources
- GitHub: linxule/mineru-mcp
- npm: mineru-mcp
- Smithery: Install for any AI client
- MinerU Platform: mineru.net
- MinerU Open Source: opendatalab/MinerU
- Related: OCR Guide | SLR Workflow