WebWatch MCP
Website change monitoring with URL watching, snapshot comparison, and intelligent content extraction for AI assistants
🔍 WebWatch MCP
A Model Context Protocol (MCP) server for monitoring website changes. Track URLs, detect content changes, compare snapshots, and extract content with CSS selectors — all from your AI assistant.
Features
- Watch URLs — Monitor any webpage for content changes
- CSS Selectors — Track specific elements (prices, headlines, stock info)
- Snapshot History — SQLite-backed storage of every page state
- Diff Detection — See exactly what changed, line by line
- Content Extraction — Pull structured data from any page on demand
Tools
| Tool | Description |
|---|---|
watch_url | Add a URL to the watch list with optional CSS selector and label |
list_watches | List all monitored URLs with status and statistics |
check_now | Fetch a URL now, compare to previous snapshot, return diff |
get_changes | Browse change history for one or all watched URLs |
compare_snapshots | Compare any two snapshots side-by-side |
extract_content | One-off content extraction with CSS selectors |
Installation
# Clone and build
git clone https://github.com/leviai-tools/webwatch-mcp.git
cd webwatch-mcp
npm install
npm run build
# Or install globally
npm install -g @leviai/webwatch-mcp
Usage with Claude Desktop
Add to your claude_desktop_config.json:
{
"mcpServers": {
"webwatch": {
"command": "node",
"args": ["/path/to/webwatch-mcp/build/index.js"],
"env": {
"WEBWATCH_DATA_DIR": "/path/to/data/dir"
}
}
}
}
Or if installed globally:
{
"mcpServers": {
"webwatch": {
"command": "webwatch-mcp"
}
}
}
Environment Variables
| Variable | Default | Description |
|---|---|---|
WEBWATCH_DATA_DIR | ~/.webwatch-mcp | Directory for SQLite database |
Example Workflows
Monitor a pricing page
You: "Watch https://example.com/pricing — track the .pricing-table element"
→ watch_url(url, selector=".pricing-table", label="Example pricing")
You: "Check if pricing changed"
→ check_now("https://example.com/pricing")
Track competitor changes
You: "Add these competitor pages to my watchlist"
→ watch_url for each URL
You: "Show me all recent changes"
→ get_changes(limit=20)
Extract data on demand
You: "What are the headlines on HN right now?"
→ extract_content("https://news.ycombinator.com", ".titleline > a")
Data Storage
All data is stored locally in SQLite at ~/.webwatch-mcp/webwatch.db (configurable via WEBWATCH_DATA_DIR).
Tables:
watches— Monitored URLs with selectors and labelssnapshots— Page content captured at each checkchanges— Detected diffs between consecutive snapshots
Requirements
- Node.js ≥ 18.0.0
License
MIT
相关服务器
Bright Data
赞助Discover, extract, and interact with the web - one interface powering automated access across the public internet.
MCP Web Scraper
A production-ready web scraping platform with ML-powered automation, browser automation via Playwright, and persistent caching.
LinkedIn
Scrape LinkedIn profiles, companies, and jobs using direct URLs. Features Claude AI integration and secure credential storage.
MyBrowserAPI
A browser API for interacting with web services like X, Reddit, ChatGPT, and WhatsApp using Puppeteer.
B2Proxy
1GB Free Trial, World's Leading Proxy Service Platform, Efficient Data Collection
Read Website Fast
Fast, token-efficient web content extraction that converts websites to clean Markdown. Features Mozilla Readability, smart caching, polite crawling with robots.txt support, and concurrent fetching with minimal dependencies.
Browser Use MCP Server
An MCP server that allows AI agents to control a web browser using the browser-use library.
MCP URL Format Converter
Fetches content from any URL and converts it to HTML, JSON, Markdown, or plain text.
MCP URL2SNAP
A lightweight MCP server that captures screenshots of any URL and returns the image URL. Requires an AbstractAPI key.
JCrawl4AI
A Java-based MCP server for interacting with the Crawl4ai web scraping API.
BrowserLoop
Take screenshots and read console logs from web pages using Playwright.