Automate browser interactions in the cloud (e.g. web navigation, data extraction, form filling, and more)
The Model Context Protocol (MCP) is an open protocol that enables seamless integration between LLM applications and external data sources and tools. Whether youโre building an AI-powered IDE, enhancing a chat interface, or creating custom AI workflows, MCP provides a standardized way to connect LLMs with the context they need.
This server provides cloud browser automation capabilities using Browserbase, Puppeteer, and Stagehand. This server enables LLMs to interact with web pages, take screenshots, and execute JavaScript in a cloud browser environment.
To learn to get started with Browserbase, check out Browserbase MCP or Stagehand MCP.
๐ Browserbase MCP - Located in browserbase/
Feature | Description |
---|---|
Browser Automation | Control and orchestrate cloud browsers |
Data Extraction | Extract structured data from any webpage |
Console Monitoring | Track and analyze browser console logs |
Screenshots | Capture full-page and element screenshots |
JavaScript | Execute custom JS in the browser context |
Web Interaction | Navigate, click, and fill forms with ease |
๐ค Stagehand MCP - Located in stagehand/
Feature | Description |
---|---|
Atomic Instructions | Execute precise actions like act("click the login button") or extract("find the red shoes") |
Model Flexibility | Supports multiple models, including OpenAI's GPT-4 and Anthropic's Claude-3.7 Sonnet |
Modular Design | Easily integrate new models with minimal changes |
Vision Support | Use annotated screenshots for complex DOMs |
Open Source | Contribute to the project and join the Slack community for support |
Web content fetching and conversion for efficient LLM usage
Browser automation and web scraping
Enable AI agents to get structured data from unstructured web with AgentQL.
Actors MCP Server: Use 3,000+ pre-built cloud tools to extract data from websites, e-commerce, social media, search engines, maps, and more
Extract web data with Firecrawl
Hyperbrowser is the next-generation platform empowering AI agents and enabling effortless, scalable browser automation.
Scrape websites with Oxylabs Web API, supporting dynamic rendering and parsing for structured data extraction.
Playwright MCP server
Render website screenshots with ScreenshotOne
Automate your local browser