Mozilla Readability Parser

Extracts and transforms webpage content into clean, LLM-optimized Markdown using Mozilla's Readability algorithm.

Mozilla Readability Parser MCP Server

An model context protocol (MCP) server that extracts and transforms webpage content into clean, LLM-optimized Markdown. Returns article title, main content, excerpt, byline and site name. Uses Mozilla's Readability algorithm to remove ads, navigation, footers and non-essential elements while preserving the core content structure. More about MCP.

Mozilla Readability Parser Server MCP server

Features

Removes ads, navigation, footers and other non-essential content
Converts clean HTML into well-formatted Markdown (also uses Turndown)
Returns article metadata (title, excerpt, byline, site name)
Handles errors gracefully

Why Not Just Fetch?

Unlike simple fetch requests, this server:

Extracts only relevant content using Mozilla's Readability algorithm
Eliminates noise like ads, popups, and navigation menus
Reduces token usage by removing unnecessary HTML/CSS
Provides consistent Markdown formatting for better LLM processing
Includes useful metadata about the content

Installation

Installing via Smithery

To install Mozilla Readability Parser for Claude Desktop automatically via Smithery:

npx -y @smithery/cli install server-moz-readability --client claude

Manual Installation

npm install server-moz-readability

Tool Reference

`parse`

Fetches and transforms webpage content into clean Markdown.

Arguments:

{ "url": { "type": "string", "description": "The website URL to parse", "required": true } }

Returns:

{ "title": "Article title", "content": "Markdown content...", "metadata": { "excerpt": "Brief summary", "byline": "Author information", "siteName": "Source website name" } }

Usage with Claude Desktop

Add to your claude_desktop_config.json:

{ "mcpServers": { "readability": { "command": "npx", "args": ["-y", "server-moz-readability"] } } }

Dependencies

@mozilla/readability - Content extraction
turndown - HTML to Markdown conversion
jsdom - DOM parsing
axios - HTTP requests

License

MIT

Related Servers

Bright Data

sponsor

Discover, extract, and interact with the web - one interface powering automated access across the public internet.

Crypto News MCP Server

Fetches the latest cryptocurrency news and converts article content from HTML to Markdown.

MCP Deep Web Research Server

An advanced web research server with intelligent search queuing, enhanced content extraction, and deep research capabilities.

Playwright Server

A server for browser automation using the Playwright library.

just-every/mcp-screenshot-website-fast

High-quality screenshot capture optimized for Claude Vision API. Automatically tiles full pages into 1072x1072 chunks (1.15 megapixels) with configurable viewports and wait strategies for dynamic content.

Playwright Server

Automate web browsers and perform web scraping tasks using the Playwright framework.

WebDriverIO

Automate web browsers using WebDriverIO. Supports actions like clicking, filling forms, and taking screenshots.

MCP Browser Use Secure

A secure MCP server for browser automation with enhanced security features like multi-layered protection and session isolation.

Scrapling Fetch MCP

Fetches HTML and markdown from websites with anti-automation measures using Scrapling.

Nefino

Access the Nefino renewable energy news API.

MCP-Puppeteer-Linux

Automate web browsers on Linux using Puppeteer. Enables LLMs to interact with web pages, take screenshots, and execute JavaScript.