CodingBaby Browser
A Node.js server that enables AI assistants to control the Chrome browser via WebSocket. Requires the CodingBaby Chrome Extension.
CodingBaby-Browser-MCP
Overview
CodingBaby-Browser-MCP is a powerful tool that enables AI assistants like Claude 3.7 Sonnet in Cursor to control Chrome browser for automated tasks. This tool bridges the gap between AI and web browser interaction through a WebSocket-based communication protocol.
Features
- Browser Automation: Control Chrome browser programmatically to navigate websites, fill forms, and perform clicks
- Screenshot Capture: Take screenshots of entire pages or specific areas
- Multi-tab Support: Create, list, select, and close browser tabs
- Form Interaction: Type text, press keys, and select form elements
- Batch Commands: Execute multiple browser operations in sequence
- Viewport Control: Adjust browser window size for responsive testing
Architecture
The project consists of two main components:
- MCP Tool Server: A Node.js server that implements the Model Context Protocol (MCP) to communicate with AI assistants in Cursor
- Chrome Extension: A browser extension that receives commands from the MCP server and controls the browser
The system uses WebSocket (port 9876 by default) to establish a bidirectional communication channel between the MCP server and the Chrome extension.
Installation
Prerequisites
- Node.js (v14 or higher)
- Chrome browser
- Cursor editor with Claude 3.7 Sonnet
MCP Tool Setup
- In Cursor, go to Settings → MCP
- Add new global MCP server with the following configuration:
{
"mcpServers": {
"CodingBaby-Browser-MCP": {
"command": "npx",
"args": ["@sydneyassistent/codingbaby-browser-mcp"]
}
}
}
Chrome Extension Setup
- Install the CodingBaby Extension from the Chrome Web Store
- Enable the extension and ensure it has the necessary permissions
Usage
Once installed, you can ask Claude 3.7 in Cursor to control your browser:
Use the browser to navigate to https://example.com
Available Commands
navigate: Open a URL in the browserclick: Click on elementstype: Enter text in form fieldspressKey: Simulate keyboard actionsscroll: Scroll in any directiontakeScreenshot: Capture browser contentwait: Pause execution for specified timesetViewport: Change browser window dimensionstabNew,tabList,tabSelect,tabClose: Tab managementbatch: Execute multiple commands in sequenceclose: Close the browser session
Development and Debugging
If you've downloaded the source code, you can set up the project for development and debugging purposes.
Debugging Chrome Extension
To load and debug the Chrome extension from source code:
- Open Chrome and navigate to
chrome://extensions/ - Enable "Developer mode" by toggling the switch in the top-right corner
- Click "Load unpacked" button
- Navigate to the
chrome-extensiondirectory in the project and select it - The extension will now be loaded in developer mode
- You can view console logs by right-clicking the extension icon, selecting "Inspect" and opening the Console tab
- Make changes to the extension code and click the refresh icon on the extension card to apply changes
Debugging MCP Server in Cursor
To use the local MCP server code for debugging:
- Clone or download the repository to your local machine
- Navigate to the project directory and install dependencies:
cd Browser-MCP npm install - In Cursor, go to Settings → MCP
- Add a new MCP server with the following configuration:
Replace{ "mcpServers": { "CodingBaby-Browser-MCP-Dev": { "command": "node", "args": ["/absolute/path/to/your/Browser-MCP/index.js"] } } }/absolute/path/to/your/with the actual path to the downloaded project - Click "Refresh" to load the MCP server
- You can now make changes to the MCP server code and restart the server by clicking "Refresh" in Cursor's MCP settings
For debugging, you can:
- Check the Cursor MCP logs by clicking on the MCP status icon
- Add
console.error()statements to the code for more detailed logging - Run the MCP server manually from the terminal for full console output
Troubleshooting
- Port Conflict: If port 9876 is already in use, the tool will attempt to release it automatically
- Connection Issues: Ensure the Chrome extension is properly installed and enabled
License
MIT
Links
- Official Website: www.codingbaby.fun
- NPM Package: @sydneyassistent/codingbaby-browser-mcp
- Chrome Extension: CodingBaby Extension
相关服务器
Bright Data
赞助Discover, extract, and interact with the web - one interface powering automated access across the public internet.
Conduit
Headless browser with SHA-256 hash-chained audit trails and Ed25519-signed proof bundles. MCP server for AI agents.
Chrome Debug
Automate Chrome via its debugging port with session persistence. Requires Chrome to be started with remote debugging enabled.
Browserbase
Automate browser interactions in the cloud (e.g. web navigation, data extraction, form filling, and more)
freesound-mcp
A Model Context Protocol (MCP) server that enables AI applications to search and download audio resources from the Freesound platform via natural language commands.
Plain Markdown
Convert any URL to clean, LLM-ready Markdown
SearchMCP
Connect any LLM to the internet with the cheapest, most reliable, and developer-friendly search API.
Bilibili
Interact with the Bilibili video website, enabling actions like searching for videos, retrieving video information, and accessing user data.
Hacker News
Fetches and parses stories from Hacker News, providing structured data for top, new, ask, show, and job posts.
Steel Puppeteer
Provides browser automation capabilities using Puppeteer and Steel, configurable for local or cloud instances.
CompanyScope MCP
Company intelligence in one tool call — funding, tech stack, employees, competitors, news from public APIs