Extracts web content using the Jina.ai Reader API.
This repository is no longer maintained.
The functionality of this tool is now available in mcp-omnisearch, which combines multiple MCP tools in one unified package.
Please use mcp-omnisearch instead.
A Model Context Protocol (MCP) server for integrating Jina.ai's Reader API with LLMs. This server provides efficient and comprehensive web content extraction capabilities, optimized for documentation and web content analysis.
This server requires configuration through your MCP client. Here are examples for different environments:
Add this to your Cline MCP settings:
{
"mcpServers": {
"jinaai-reader": {
"command": "node",
"args": ["-y", "mcp-jinaai-reader"],
"env": {
"JINAAI_API_KEY": "your-jinaai-api-key"
}
}
}
}
For WSL environments, add this to your Claude Desktop configuration:
{
"mcpServers": {
"jinaai-reader": {
"command": "wsl.exe",
"args": [
"bash",
"-c",
"JINAAI_API_KEY=your-jinaai-api-key npx mcp-jinaai-reader"
]
}
}
}
The server requires the following environment variable:
JINAAI_API_KEY
: Your Jina.ai API key (required)The server implements a single MCP tool with configurable parameters:
Convert any URL to LLM-friendly text using Jina.ai Reader.
Parameters:
url
(string, required): URL to processno_cache
(boolean, optional): Bypass cache for fresh results.
Defaults to falseformat
(string, optional): Response format ("json" or "stream").
Defaults to "json"timeout
(number, optional): Maximum time in seconds to wait for
webpage loadtarget_selector
(string, optional): CSS selector to focus on
specific elementswait_for_selector
(string, optional): CSS selector to wait for
specific elementsremove_selector
(string, optional): CSS selector to exclude
specific elementswith_links_summary
(boolean, optional): Gather all links at the
end of responsewith_images_summary
(boolean, optional): Gather all images at the
end of responsewith_generated_alt
(boolean, optional): Add alt text to images
lacking captionswith_iframe
(boolean, optional): Include iframe content in
responsenpm install
npm run build
npm run dev
npm run build
npm publish
Contributions are welcome! Please feel free to submit a Pull Request.
MIT License - see the LICENSE file for details.
Provides browser automation capabilities using Puppeteer and Steel, configurable for local or cloud instances.
Control a browser for web automation tasks using Playwright on Cloudflare Workers.
Download video and audio from various platforms like YouTube, Facebook, and TikTok using yt-dlp.
Provides web context to LLMs using Cloudflare's Browser Rendering API.
Query financial web tables from sources like iwencai, tdx, and eastmoney using Playwright.
Summarizes reviews for a specific location from Google Maps.
A web search and content extraction tool using the Firecrawl API for advanced web scraping, searching, and content analysis.
Fetches content from any URL and converts it to HTML, JSON, Markdown, or plain text.
Extract web data with Firecrawl
A server for web research that brings real-time information into AI models and researches any topic.