Skip to main content
MinerU MCP integrates MinerU’s document parsing API directly into Claude, enabling AI-powered document analysis without leaving your research workflow.
Created by: Xule LinKey Stats: 90%+ accuracy (VLM mode) | 109 languages | Up to 200 documents per batch | 73% token reductionPerfect for: Systematic literature reviews, batch PDF processing, research corpus preparation
Part of the Ecosystem: MinerU is available as an optional MCP in the Interpretive Orchestration Plugin, powering high-accuracy PDF parsing for qualitative research workflows!

What is MinerU MCP?

An MCP server that wraps MinerU’s document parsing API, optimized for Claude Code research workflows. Instead of switching between tools or running scripts, you can parse documents directly within your AI conversation. Why use MinerU MCP instead of manual conversion?
  • Integrated workflow: Parse PDFs without leaving Claude
  • Batch processing: Handle 200 documents simultaneously
  • Quality options: Choose speed (Pipeline) or accuracy (VLM)
  • 73% token reduction: Optimized tool descriptions for efficient context usage

The Four Tools

1. mineru_parse

Process a single document with customizable options.
ParameterDescriptionDefault
urlDocument URL (required)-
modelpipeline (fast) or vlm (accurate)pipeline
ocrEnable OCR for scanned documentsfalse
formulaRecognize mathematical/chemical formulasfalse
tableDetect and extract tablestrue
languageOCR language (109 supported)en
Example prompt:
Parse this paper with VLM mode for maximum accuracy:
https://arxiv.org/pdf/2401.12345.pdf

2. mineru_status

Check task completion and get download URLs. Example prompt:
Check the status of my parsing job and download the markdown when ready

3. mineru_batch

Process multiple documents simultaneously - perfect for SLR corpus preparation. Limits:
  • Maximum 200 documents per batch
  • 200MB per file, 600 pages per document
  • 2000 pages/day at high priority
Example prompt:
Batch process these 50 papers using VLM mode for my literature review:
[list of URLs]

4. mineru_batch_status

Retrieve paginated results from batch jobs.

VLM Mode vs Pipeline Mode

Best for: Academic papers, complex layouts, tables, formulas
  • 90%+ accuracy using Vision Language Models
  • Slower processing (worth the wait for important documents)
  • Higher API cost
  • Recommended for SLR corpus where accuracy matters
Parse with model='vlm' for maximum accuracy

Use Cases for Research

1. SLR Corpus Preparation

Converting 50+ papers for systematic review:
I have 47 papers from my Scopus search that need to be converted
to markdown for analysis. Here are the URLs:
[paste URLs]

Use VLM mode for accurate table extraction. This is for my
systematic literature review on organizational learning.

2. Batch Processing for Literature Analysis

Screen a large set before detailed analysis:
Quick parse these 100 papers using pipeline mode to extract
abstracts and key sections. I'll do detailed VLM parsing
on the 20 most relevant ones later.

3. Multilingual Research

MinerU supports 109 OCR languages:
Parse this German-language paper with OCR enabled and
language set to 'de'. Extract the methodology section.

Installation & Setup

Step 1: Get API Key

  1. Visit mineru.net
  2. Create account and generate API key
  3. Save securely (you’ll need it for configuration)

Step 2: Install MCP

claude mcp add mineru-mcp -e MINERU_API_KEY=your-api-key -- npx mineru-mcp
Verify with claude mcp list - you should see mineru-mcp available.

Configuration Options

VariableDefaultPurpose
MINERU_API_KEYRequiredBearer token from mineru.net
MINERU_BASE_URLhttps://mineru.net/api/v4API endpoint
MINERU_DEFAULT_MODELpipelineDefault parsing mode

Integration with Research Memex

With OCR Guide

MinerU MCP is the recommended approach for PDF conversion in Research Memex workflows. See the PDF to Markdown Conversion Guide for comparison with other methods.

With SLR Workflow

Use MinerU for batch PDF processing in your Systematic Literature Review workflow. Perfect for converting your Zotero exports to AI-ready markdown.

With Interpretive Orchestration

MinerU is bundled as an optional MCP in the Interpretive Orchestration Plugin for qualitative research. It powers document ingestion alongside Markdownify for a complete document processing pipeline.

MinerU vs Mistral OCR

FeatureMinerU MCPMistral OCR (Script)
IntegrationMCP (inline in Claude)Python script
Best forClaude workflows, real-timeBulk offline processing
Batch limit200 docsUnlimited
VLM modeYes (90%+)No
Languages109Variable
SetupAPI key + MCPAPI key + Python
CostPer-page APIPer-page API
Recommendation: Use MinerU MCP for integrated Claude workflows. Use Mistral script for very large offline batch jobs.

Limitations & Considerations

  • API key required - Get from mineru.net
  • File size: 200MB max per file
  • Page limit: 600 pages per document
  • Daily quota: 2000 pages at high priority
  • VLM mode: More accurate but slower and costlier

Resources


Meta-Moment: This MCP was created to solve a real problem: batch processing PDFs for systematic literature reviews without context-switching. Now it’s documented in the same Research Memex that inspired its creation.The tools shape the methodology, and the methodology shapes the tools. 📄→🔄→📚

← Back to Advanced Topics