Transform PDFs, Word docs, presentations, and images into AI-ready formats using MinerU’s parsing API - optimized for Claude Code research workflows with 90%+ accuracy, 109 languages, and batch processing up to 200 documents
MinerU MCP integrates MinerU’s document parsing API directly into Claude, enabling AI-powered document analysis without leaving your research workflow.
Created by: Xule Lin | Version: 1.1.3 | GitHubKey Stats: 90%+ accuracy (VLM mode) | 109 languages | Up to 200 documents per batch | 73% token reductionPerfect for: Systematic literature reviews, batch PDF processing, research corpus preparation
Part of the Ecosystem: MinerU is available as an optional MCP in the Interpretive Orchestration Plugin, powering high-accuracy PDF parsing for qualitative research workflows!
An MCP server that wraps MinerU’s document parsing API, optimized for Claude Code research workflows. Instead of switching between tools or running scripts, you can parse documents directly within your AI conversation.Why use MinerU MCP instead of manual conversion?
Integrated workflow: Parse documents without leaving Claude
Download processed results as named markdown files to a local directory.
Parameter
Description
Default
batch_id
Batch ID to download results for (required)
-
output_dir
Local directory for output files (required)
-
overwrite
Overwrite existing files
false
Local File Workflow: Tools 5 and 6 enable a complete local pipeline — upload files from your machine with mineru_upload_batch, poll with mineru_batch_status, then save results with mineru_download_results. No URLs or manual downloads needed.
I have 47 papers from my Scopus search that need to be convertedto markdown for analysis. Here are the URLs:[paste URLs]Use VLM mode for accurate table extraction. This is for mysystematic literature review on organizational learning.
MinerU MCP is the recommended approach for PDF conversion in Research Memex workflows. See the PDF to Markdown Conversion Guide for comparison with other methods.
MinerU is bundled as an optional MCP in the Interpretive Orchestration Plugin for qualitative research. It powers document ingestion alongside Markdownify for a complete document processing pipeline.
Meta-Moment: This MCP was created to solve a real problem: batch processing PDFs for systematic literature reviews without context-switching. Now it’s documented in the same Research Memex that inspired its creation.The tools shape the methodology, and the methodology shapes the tools.