Markdown Downloader
Download webpages as markdown files using the r.jina.ai service, with configurable directories and persistent settings.
Markdown Downloader MCP Server
Overview
Markdown Downloader is a powerful MCP (Model Context Protocol) server that allows you to download webpages as markdown files with ease. Leveraging the r.jina.ai service, this tool provides a seamless way to convert web content into markdown format.
Features
- 🌐 Download webpages as markdown using r.jina.ai
- 📁 Configurable download directory
- 📝 Automatically generates date-stamped filenames
- 🔍 List downloaded markdown files
- 💾 Persistent configuration
Prerequisites
- Node.js (version 16 or higher)
- npm (Node Package Manager)
Installation
Installing via Smithery
To install Markdown Downloader for Claude Desktop automatically via Smithery:
npx -y @smithery/cli install @dazeb/markdown-downloader --client claude
Installing manually
-
Clone the repository:
git clone https://github.com/your-username/markdown-downloader.git cd markdown-downloader -
Install dependencies:
npm install -
Build the project:
npm run build
Manually Add Server to Cline/Roo-Cline MCP Settings file
Linux/macOS
{
"mcpServers": {
"markdown-downloader": {
"command": "node",
"args": [
"/home/user/Documents/Cline/MCP/markdown-downloader/build/index.js"
],
"disabled": false,
"alwaysAllow": [
"download_markdown",
"set_download_directory"
]
}
}
}
Windows
{
"mcpServers": {
"markdown-downloader": {
"command": "node",
"args": [
"C:\\Users\\username\\Documents\\Cline\\MCP\\markdown-downloader\\build\\index.js"
],
"disabled": false,
"alwaysAllow": [
"download_markdown",
"set_download_directory"
]
}
}
}
Tools and Usage
1. Set Download Directory
Change the download directory:
use set_download_directory /path/to/your/local/download/folder
- Validates directory exists and is writable
- Persists the configuration for future use
2. Download Markdown
Download a webpage as a markdown file:
use tool download_markdown https://example.com/blog-post
- The URL will be prepended with
r.jina.ai - Filename format:
{sanitized-url}-{date}.md - Saved in the configured download directory
3. List Downloaded Files
List all downloaded markdown files:
use list_downloaded_files
4. Get Download Directory
Retrieve the current download directory:
use get_download_directory
Configuration
Linux/macOS
- Configuration is stored in
~/.config/markdown-downloader/config.json - Default download directory:
~/.markdown-downloads
Windows
- Configuration is stored in
%APPDATA%\markdown-downloader\config.json - Default download directory:
%USERPROFILE%\Documents\markdown-downloads
Troubleshooting
- Ensure you have an active internet connection
- Check that the URL is valid and accessible
- Verify write permissions for the download directory
Security
- The tool uses r.jina.ai to fetch markdown content
- Local files are saved with sanitized filenames
- Configurable download directory allows flexibility
Contributing
Contributions are welcome! Please feel free to submit a Pull Request.
License
This project is licensed under the MIT License. See the LICENSE file for details.
Disclaimer
This tool is provided as-is. Always review downloaded content for accuracy and appropriateness.
Support
For issues or feature requests, please open an issue on the GitHub repository.
Related Servers
Bright Data
sponsorDiscover, extract, and interact with the web - one interface powering automated access across the public internet.
Anysite
Turn any website into an API
Scrapezy
Turn websites into datasets with Scrapezy
ElToque MCP Server
Fetches USD and EUR prices from the Cuban parallel market via eltoque.com.
infomate-mcp
MCP server that fetches and summarizes tech news from infomate.club/vas3k
UseScraper
A server for web scraping using the UseScraper API.
Scrapling Fetch MCP
Fetches HTML and markdown from websites with anti-automation measures using Scrapling.
Skyvern
MCP Server to let Claude / your AI control the browser
Outscraper
Extract data from Google Maps, including places and reviews, using the Outscraper API.
Firecrawl
Extract web data with Firecrawl
BrowserLoop
Take screenshots and read console logs from web pages using Playwright.
