Back to Opendataloader Pdf

OpenDataLoader PDF MCP Server

python/opendataloader-pdf-mcp/README.md

2.4.21.8 KB
Original Source

OpenDataLoader PDF MCP Server

MCP (Model Context Protocol) server for OpenDataLoader PDF.

Enables AI agents to convert PDFs to Markdown, JSON, HTML, and more via MCP.

Prerequisites

  • Java 11+
  • Python 3.10+

Installation

bash
pip install opendataloader-pdf-mcp

Usage

Claude Desktop

Add to your Claude Desktop config (claude_desktop_config.json):

json
{
  "mcpServers": {
    "opendataloader-pdf": {
      "command": "uvx",
      "args": ["opendataloader-pdf-mcp"]
    }
  }
}

Claude Code

bash
claude mcp add opendataloader-pdf -- uvx opendataloader-pdf-mcp

OpenAI Codex

bash
codex --mcp-config mcp.json

mcp.json:

json
{
  "mcpServers": {
    "opendataloader-pdf": {
      "command": "uvx",
      "args": ["opendataloader-pdf-mcp"]
    }
  }
}

Cursor

Add to .cursor/mcp.json in your project:

json
{
  "mcpServers": {
    "opendataloader-pdf": {
      "command": "uvx",
      "args": ["opendataloader-pdf-mcp"]
    }
  }
}

Windsurf

Add to ~/.codeium/windsurf/mcp_config.json:

json
{
  "mcpServers": {
    "opendataloader-pdf": {
      "command": "uvx",
      "args": ["opendataloader-pdf-mcp"]
    }
  }
}

Other MCP Clients

Any MCP-compatible client can use this server. The command is:

bash
uvx opendataloader-pdf-mcp

Tools

convert_pdf

Convert a PDF file to the specified format.

Parameters:

  • input_path (required): Path to the input PDF file
  • format: Output format — json, text, html, markdown (default), markdown-with-html, markdown-with-images
  • pages: Pages to extract (e.g., "1,3,5-7")
  • password: Password for encrypted PDFs
  • All other OpenDataLoader PDF options are supported

License

Apache-2.0