MCP LLMS.txt Explorer
Explore and analyze websites that have implemented the llms.txt standard.
MCP LLMS.txt Explorer
A Model Context Protocol server for exploring websites with llms.txt files. This server helps you discover and analyze websites that implement the llms.txt standard.
Features
Resources
- Check websites for llms.txt and llms-full.txt files
- Parse and validate llms.txt file contents
- Access structured data about compliant websites
Tools
check_website- Check if a website has llms.txt files- Takes domain URL as input
- Returns file locations and validation status
list_websites- List known websites with llms.txt files- Returns structured data about compliant websites
- Supports filtering by file type (llms.txt/llms-full.txt)
Development
Install dependencies:
pnpm install
Build the server:
pnpm run build
For development with auto-rebuild:
pnpm run watch
Installation
Installing via Smithery
To install mcp-llms-txt-explorer for Claude Desktop automatically via Smithery:
npx -y @smithery/cli install @thedaviddias/mcp-llms-txt-explorer --client claude
Installing Manually
To use this server:
# Clone the repository
git clone https://github.com/thedaviddias/mcp-llms-txt-explorer.git
cd mcp-llms-txt-explorer
# Install dependencies
pnpm install
# Build the server
pnpm run build
Configuration with Claude Desktop
To use with Claude Desktop, add the server config:
On MacOS: ~/Library/Application Support/Claude/claude_desktop_config.json
On Windows: %APPDATA%/Claude/claude_desktop_config.json
{
"mcpServers": {
"llms-txt-explorer": {
"command": "node",
"args": ["/path/to/llms-txt-explorer/build/index.js"],
}
}
}
For npx usage, you can use:
{
"mcpServers": {
"llms-txt-explorer": {
"command": "npx",
"args": ["-y", "@thedaviddias/mcp-llms-txt-explorer"]
}
}
}
Debugging
Since MCP servers communicate over stdio, debugging can be challenging. We recommend using the MCP Inspector, which is available as a package script:
pnpm run inspector
The Inspector will provide a URL to access debugging tools in your browser.
License
This project is licensed under the MIT License—see the LICENSE file for details.
Related Servers
Bright Data
sponsorDiscover, extract, and interact with the web - one interface powering automated access across the public internet.
Patchright Lite MCP Server
A server that wraps the Patchright SDK to provide stealth browser automation for AI models.
MCP FetchPage
Intelligent web page fetching with automatic cookie support and CSS selector extraction.
WebScraping.AI
Interact with WebScraping.AI for web data extraction and scraping.
YouTube MCP Server
Extract metadata and captions from YouTube videos and convert them to markdown.
MCP Webscan Server
Fetch, analyze, and extract information from web pages.
Agentic Deep Researcher
A deep research agent powered by Crew AI and the LinkUp API.
MeteoSwiss Data
Provides weather reports, search, and content from the MeteoSwiss website with multi-language support.
Puppeteer
Provides browser automation using Puppeteer, enabling interaction with web pages, taking screenshots, and executing JavaScript.
Stepstone
Fetches job listings from Stepstone.de based on keywords and location parameters.
Scrapeless
Integrate real-time Scrapeless Google SERP(Google Search, Google Flight, Google Map, Google Jobs....) results into your LLM applications. This server enables dynamic context retrieval for AI workflows, chatbots, and research tools.