PDF (By Anthropic) MCP Server
Anthropic's PDF Agent Skill for reading, extracting, creating, merging, splitting, watermarking, OCR, and form-filling PDF documents.
PDF is an official Anthropic Agent Skill (distributed via the anthropics/skills repository) that teaches Claude how to perform a wide range of PDF operations. It is not a traditional MCP server with a fixed tool list; it is a skill package (a SKILL.md file plus reference docs and helper scripts) that Claude loads dynamically to handle PDF related tasks. In Claude Code it is installed as part of the document-skills plugin, which sits in the same plugin/MCP ecosystem alongside other connectors.
The skill covers essentially the full PDF lifecycle: extracting text and tables (via pdfplumber), basic operations like merging, splitting, rotating, and metadata edits (via pypdf), authoring new PDFs (via reportlab), OCR on scanned documents (via pytesseract and pdf2image), image extraction (pdfimages), watermarking, password protection, and PDF form filling (covered in a dedicated FORMS.md reference). It also documents command line equivalents using qpdf, pdftk, and pdftotext.
What makes it notable is that it is the same skill family that powers Claude's production document features on Claude.ai paid plans, source-available rather than fully open source. Because it works by instructing Claude how to use existing Python and CLI tools rather than running a hosted service, it works fully offline once dependencies are installed.
Tools
| Tool | Description |
|---|---|
extract_text |
Extract plain or layout-preserving text from PDFs using pdfplumber or pdftotext. |
extract_tables |
Pull tables from PDFs into structured data (pandas/Excel) using pdfplumber. |
merge_pdfs |
Combine multiple PDFs into one document using pypdf or qpdf. |
split_pdf |
Split a PDF into individual pages or page ranges. |
rotate_pages |
Rotate one or more pages by 90/180/270 degrees. |
add_watermark |
Overlay a watermark PDF onto another PDF using pypdf merge_page. |
create_pdf |
Generate new PDFs from scratch with reportlab, including multi-page Platypus layouts. |
fill_form |
Programmatically fill PDF AcroForm fields (see FORMS.md for guidance). |
encrypt_pdf |
Apply password protection / encryption to a PDF. |
decrypt_pdf |
Remove password protection from a PDF using qpdf. |
extract_images |
Extract embedded images from a PDF using pdfimages. |
ocr_pdf |
Run OCR on scanned PDFs using pytesseract and pdf2image to produce searchable text. |
Prerequisites
- Claude Code (or Claude.ai with paid plan, or Claude API with Skills support)
- Python 3 with the libraries used by the skill:
pypdf,pdfplumber,reportlab,pdf2image,pytesseract - Optional CLI tools:
qpdf,pdftk,poppler-utils(pdftotext,pdfimages),tesseract
Install in Claude Code (recommended)
The PDF skill ships as part of the document-skills plugin from Anthropic's official skills marketplace:
/plugin marketplace add anthropics/skills
/plugin install document-skills@anthropic-agent-skills
Once installed, invoke it by mentioning PDF work in your prompt, for example:
Use the PDF skill to extract the form fields from ./contract.pdf
Manual install / inspection
Clone the repo to read or vendor the skill directly:
git clone https://github.com/anthropics/skills.git
cd skills/skills/pdf
Files of interest:
SKILL.md(main instructions and YAML frontmatter)FORMS.md(PDF form filling guidance)reference.md(extended API reference)scripts/(helper scripts)
Claude.ai
Built-in for paid plans. Custom variants can be uploaded under Settings > Capabilities > Skills.
Claude API
Use the Skills API to register the skill. See the Anthropic Skills guide at docs.claude.com/en/api/skills-guide.
Note
This is an Agent Skill, not a traditional standalone MCP server. It augments Claude with a SKILL.md plus helper scripts rather than exposing a long-lived JSON-RPC server.
- Extract structured tables from invoices, bank statements, or financial reports into pandas or Excel for downstream analysis.
- Auto-fill recurring PDF forms (W-9, NDAs, intake paperwork) given a JSON payload of field values.
- Merge a generated cover letter, signed contract, and supporting exhibits into a single delivery-ready PDF.
- Run OCR on scanned legal or medical PDFs to make them searchable, then extract specific clauses or data points.
- Generate branded multi-page PDF reports with reportlab from query results or Markdown content.
- "Use the PDF skill to extract all tables from
q3-financials.pdfand save them as separate sheets in an Excel file." - "Merge
cover.pdf,contract.pdf, andexhibits.pdfinto one file calledsigned-package.pdf." - "Fill the form fields in
w9-template.pdfwith the values invendor.jsonand save asacme-w9.pdf." - "OCR the scanned PDF
lease.pdf, then find every clause that mentions a renewal date." - "Create a 5-page PDF report from this Markdown summary, with a title page and page numbers."
- Maintained by Anthropic and source-available in the official
anthropics/skillsrepo, so it tracks what Claude actually uses in production. - Broad coverage: extraction, creation, merging, splitting, OCR, encryption, watermarks, and form filling in one skill.
- Works fully locally using standard Python and CLI tooling, with no external API calls or cloud costs.
- Includes dedicated supplementary docs (
FORMS.md,reference.md) for the tricky cases.
- It is an Agent Skill, not a long-running MCP server, so it does not expose a fixed JSON-RPC tool surface and behavior depends on Claude following the instructions.
- License is proprietary ("source-available"), so it cannot be freely redistributed or modified like an MIT/Apache MCP server.
- Requires Python plus optional CLI dependencies (
tesseract,qpdf,poppler) to be installed on the host for full functionality.
- Markdownify MCP: converts PDFs, DOCX, PPTX, HTML, and YouTube transcripts into Markdown via an MCP server.
- Docling MCP: IBM's document parsing toolkit with a community MCP server for PDF to Markdown conversion.
- Desktop Commander: general-purpose MCP server with file operations across PDF, DOCX, Excel, and more.