Stay Updated

New tutorials, tips, and Atlassian insights. No spam, unsubscribe anytime.

L
LeanZero

An approachable expert helping teams simplify their Atlassian ecosystems. Sharing knowledge and building community, one solution at a time.

Services

  • Atlassian Migrations
  • AI Implementation Advisory
  • Forge App Development

Company

  • Blog
  • Contact

Community

  • Join Discord
  • Support this site

© 2026 LeanZero. All rights reserved.

Privacy Policy|Terms of Service|Service Level Agreement|Trust Center
  1. Home
  2. Portfolio
  3. Mcp Doc Processor
MCP Server

MCP Doc Processor

A Model Context Protocol server that reads, analyzes, creates, and edits PDF, DOCX, and Excel files. Give your AI agent full document intelligence with 10 specialized tools.

PDF / DOCX / Excel
Vision OCR
Document Generation
7 Style Presets
View on GitHub

Read & Analyze Any Document

Extract text, understand structure, and answer questions about PDF, DOCX, and Excel files - including scanned documents via vision-based OCR.

PDF Processing

Native text extraction for digital PDFs. Vision-based OCR for scanned documents with AI-powered post-processing that fixes broken words, spacing, and character errors.

DOCX & Excel

Rich text extraction from Word documents preserving formatting and embedded images. Multi-sheet Excel parsing with full cell value extraction.

Focused Analysis

Don't just dump text - ask specific questions. The focused query tool lets AI agents interrogate documents for exactly the information they need.

Create & Edit Documents

Generate professional Word documents and Excel spreadsheets directly from AI conversations. Edit existing files without losing formatting.

DOCX Generation

  • Titles, paragraphs with 3 heading levels, and tables
  • Headers and footers with automatic page numbers
  • Custom background colors and page styling
  • Inline code shading and fenced code blocks
  • 7 professional style presets

Excel Generation

  • Multiple sheets with structured data
  • Column widths, row heights, and font styling
  • Formatted headers with background colors
  • Same 7 style presets with optimized Excel colors
  • Append rows, add sheets, or replace data

DOCX XML Patching

Unlike naive approaches that rebuild documents from scratch, the edit-doc tool works directly with the XML inside the DOCX ZIP structure. This means edits preserve all original formatting - headers, footers, images, custom styles, and complex layouts remain intact when appending or replacing content.

10 MCP Tools

Every capability is exposed as a Model Context Protocol tool that any MCP-compatible AI agent can use.

get-doc-summary

High-level document overview with structure, sections, and embedded images

get-doc-indepth

Full text extraction with formatting, metadata, and embedded images

get-doc-focused

Targeted analysis answering specific questions about a document

create-doc

Create Word documents with headings, tables, headers, footers, and 7 style presets

create-excel

Create Excel workbooks with multiple sheets, styling, and formatted headers

edit-doc

Edit DOCX files via XML patching - preserves original formatting, headers, and images

edit-excel

Append rows, add sheets, or replace data in existing Excel workbooks

check-document

Prevent duplicates with create/augment/replace guidance before writing

save-memory

Persist document preferences across sessions via Document DNA

delete-memory

Remove saved document preferences when no longer needed

7 Professional Style Presets

Each preset defines a complete typographic system - fonts, sizes, heading levels, spacing, justification, and table styling. Documents can also be auto-styled by category.

Minimal

Arial

Everyday, clean documents

11pt

Professional

Garamond

Executive summaries, formal reports

11pt

Technical

Arial

API docs, specs, user manuals

11pt

Legal

Times New Roman

Contracts, agreements, legal briefs

12pt

Business

Calibri

Proposals, go-to-market plans

11pt

Casual

Verdana

Internal comms, team updates

12pt

Colorful

Arial

Presentations, marketing materials

12pt

Document DNA & Intelligence

A built-in AI guidance system that learns your preferences, prevents duplicates, and automatically styles documents based on project identity.

Project DNA

A .document-dna.json config stores your company name, default style, headers, and footers. Every document inherits your brand automatically. Three-level inheritance: system defaults, project DNA, and user overrides.

Category Classification

Documents are automatically categorized by keywords into contracts, technical, business, legal, meeting, or research. Each category maps to the right style preset and folder structure.

Document Registry

A persistent registry tracks every document created for deduplication and discovery. Thread-safe with atomic file locking for concurrent access. No more accidentally creating the same report twice.

Under the Hood

Built with the official MCP SDK in pure JavaScript. No frameworks, no build step - just a fast, focused server.

MCP Protocol

Built on @modelcontextprotocol/sdk. Works with Claude Code, LM Studio, Cline, Roo Code, and any MCP-compatible client.

Vision OCR

Two vision providers with automatic failover - local LM Studio for privacy, or Z.AI cloud for higher accuracy.

XML Patching

DOCX edits work at the XML level inside the ZIP structure, preserving all original formatting during modifications.

Atomic Operations

File locking, placeholder files, and spin-wait retry for production-grade concurrent document creation.

115-140ms
Text-based PDF processing
10+ tools
MCP server interface
145+
Passing tests

Open Source & Free

MIT licensed. Install it, configure your AI agent, and start reading, creating, and editing documents in minutes.

View on GitHubJoin the Community