SideButton

Open-source MCP server with knowledge packs, 40+ browser tools, and YAML workflow engine for AI agents.

SideButton

Open-source AI agent platform — MCP server, knowledge packs, and workflow automation tools.

License Website Docs

Website · Documentation · GitHub

The AI Agent Stack — SideButton

AI agent platform with 40+ AI agent tools. Run autonomous AI agents with agentic workflows, knowledge packs, and real browser control. Connect Claude Code, Cursor, ChatGPT, or any MCP client.

npx sidebutton@latest
# Dashboard at http://localhost:9876

What you get

MCP Server40+ AI agent tools for browser control, workflow execution, knowledge pack access. Stdio and SSE transports.
REST API60+ endpoints. Trigger workflows remotely from webhooks, cron jobs, mobile apps, or other agents.
Workflow EngineAI workflow automation with 34+ step types — browser, shell, LLM, control flow. Define agentic workflows in YAML.
Knowledge PacksInstallable domain knowledge — CSS selectors, data models, state machines. Role playbooks turn coding agents into an AI software engineer, QA, or PM.
Chrome Extension40+ browser commands. Real DOM access via WebSocket, not screenshots. Recording mode.
DashboardSvelte UI — workflow browser, run logs, skill pack manager, system status.

Quick Start

# Install and start
npx sidebutton@latest

# Or from source
pnpm install && pnpm build && pnpm start

# Open http://localhost:9876

CLI

pnpm cli serve          # Start server with dashboard
pnpm cli serve --stdio  # Start with stdio transport (for Claude Desktop)
pnpm cli list           # List available workflows
pnpm cli status         # Check server status

# Skill pack management
pnpm cli registry add <path|url>   # Install skill packs from a registry
pnpm cli registry update [name]    # Update installed packs
pnpm cli registry remove <name>    # Uninstall packs and remove registry
pnpm cli search [query]            # Search available skill packs

# Creating skill packs
pnpm cli init [domain]             # Scaffold a new skill pack
pnpm cli validate [path]           # Validate pack structure
pnpm cli publish [source]          # Publish to a registry

MCP Server

SideButton is an AI agent platform and MCP server. AI coding agents connect to it directly for browser control, workflow automation, and domain knowledge.

Works with Claude Code, Cursor, Claude Desktop, VS Code, Windsurf, ChatGPT — any MCP client.

Claude Code

Add to ~/.claude/settings.json:

{
  "mcpServers": {
    "sidebutton": {
      "type": "sse",
      "url": "http://localhost:9876/mcp"
    }
  }
}

Claude Desktop

Add to ~/Library/Application Support/Claude/claude_desktop_config.json:

{
  "mcpServers": {
    "sidebutton": {
      "command": "npx",
      "args": ["sidebutton", "--stdio"]
    }
  }
}

Cursor

Add to ~/.cursor/mcp.json:

{
  "mcpServers": {
    "sidebutton": {
      "url": "http://localhost:9876/mcp"
    }
  }
}

MCP Tools

ToolDescription
run_workflowExecute a workflow by ID
list_workflowsList all available workflows
get_workflowGet workflow YAML definition
get_run_logGet execution log for a run
list_run_logsList recent workflow executions
get_browser_statusCheck browser extension connection
capture_pageCapture selectors from current page
navigateNavigate browser to URL
snapshotGet page accessibility snapshot
clickClick an element
typeType text into an element
scrollScroll the page
screenshotCapture page screenshot
hoverHover over element
extractExtract text from element
extract_allExtract all matching elements
extract_mapExtract structured data from repeated elements
select_optionSelect dropdown option
fillFill input value (React-compatible)
press_keySend keyboard keys
scroll_into_viewScroll element into viewport
evaluateExecute JavaScript in browser
existsCheck if element exists
waitWait for element or delay
check_writing_qualityEvaluate text quality

REST API

60+ JSON endpoints for external integrations. Same workflows available via MCP locally and via REST remotely.

# Run a workflow
curl -X POST http://localhost:9876/api/workflows/check_ticket/run \
  -H "Content-Type: application/json" \
  -d '{"params": {"ticket_id": "PROJ-123"}}'

# List workflows
curl http://localhost:9876/api/workflows

# Get run log
curl http://localhost:9876/api/runs/latest

Trigger workflows from webhooks, cron jobs, mobile apps, or other agents on different machines.

Workflow Engine

YAML-first orchestration. 34+ step types:

Step Types

TypeDescription
Browser
browser.navigateOpen a URL
browser.clickClick an element by selector
browser.typeType text into an element
browser.fillFill input value (React-compatible)
browser.scrollScroll the page
browser.extractExtract text from element into variable
browser.extractAllExtract all matching elements
browser.extractMapExtract structured data from repeated elements
browser.waitWait for element or fixed delay
browser.existsCheck if element exists
browser.hoverPosition cursor on element
browser.keySend keyboard keys
browser.snapshotCapture accessibility snapshot
browser.injectCSSInject CSS styles into page
browser.injectJSExecute JavaScript in page
browser.select_optionSelect dropdown option
browser.scrollIntoViewScroll element into view
Shell
shell.runExecute a bash command
terminal.openOpen a visible terminal window (macOS)
terminal.runRun command in terminal window
LLM
llm.classifyStructured classification with categories
llm.generateFree-form text generation
Control Flow
control.ifConditional branching
control.retryRetry with backoff
control.stopEnd workflow with message
workflow.callCall another workflow with parameters
Data
data.firstExtract first item from list

LLM steps work with Ollama (local), OpenAI, Anthropic, and Google.

Example

id: check_ticket_status
title: "Check Jira ticket and classify"
steps:
  - type: browser.navigate
    url: "https://your-org.atlassian.net/browse/{{ticket_id}}"
  - type: browser.extract
    selector: "[data-testid='status-field']"
    as: current_status
  - type: control.if
    condition: "{{current_status}} != 'Done'"
    then:
      - type: llm.classify
        prompt: "Should this ticket be closed? Context: {{current_status}}"
        classes: [close, keep_open]
        as: decision

Variable Interpolation

Use {{variable}} syntax to reference extracted values or parameters:

steps:
  - type: browser.extract
    selector: ".username"
    as: user
  - type: shell.run
    cmd: "echo 'Hello, {{user}}!'"

Knowledge Packs

Installable domain knowledge (skill packs) per web app or domain. Knowledge packs power AI code review, automated testing, and enterprise AI agent deployments.

Also referred to as skill packs in code and CLI commands.

  • Selectors — CSS selectors for UI elements
  • Data models — entity types, fields, relationships, valid states
  • State machines — valid transitions per state
  • Role playbooks — role-specific procedures (QA, SE, PM, SD)
  • Common tasks — step-by-step procedures, gotchas, edge cases
sidebutton install github.com
sidebutton install atlassian.net

11 domains, 28+ modules published. Open registry — build and share packs for any web app.

Chrome Extension

Install from the Chrome Web Store.

  • 40+ browser commands — navigate, click, type, extract, scroll, wait, snapshot
  • Real DOM access via CSS selectors — not pixel coordinates, not screenshots
  • Recording mode — capture manual actions as workflows
  • Embed buttons — inject action buttons into any web page
  • WebSocket connection — stable reconnection, works with local or remote server

After installing:

  1. Navigate to any website
  2. Click the SideButton extension icon
  3. Click "Connect This Tab"

Dashboard & Observability

Svelte UI at http://localhost:9876:

  • Workflow browser — list, search, run
  • Run logs — step-by-step execution traces with timing, variables, errors
  • Skill pack manager — install, browse, inspect
  • System status — extension connection, LLM config, server health

SideButton handles AI agent orchestration — from workflow execution to knowledge injection.

Architecture

┌──────────────────────────────────────────────────────────────────────────┐
│                          @sidebutton/server                               │
│                                                                          │
│  ┌─────────────────────┐   ┌──────────────────────────────────────────┐  │
│  │  stdio Transport    │   │    Fastify HTTP + WebSocket (port 9876)  │  │
│  │  ─────────────────  │   │    ────────────────────────────────────  │  │
│  │  stdin → JSON-RPC   │   │  GET  /        → Dashboard (Svelte)      │  │
│  │  stdout ← JSON-RPC  │   │  GET  /ws      → Chrome Extension WS     │  │
│  │  (Claude Desktop)   │   │  POST /mcp     → MCP JSON-RPC (SSE)      │  │
│  └──────────┬──────────┘   │  GET  /api/*   → REST API                │  │
│             │              └──────────────────────┬───────────────────┘  │
│             │                                     │                      │
│             └──────────────────┬──────────────────┘                      │
│                                ▼                                         │
│  ┌────────────────────────────────────────────────────────────────────┐  │
│  │                       @sidebutton/core                              │  │
│  │                                                                     │  │
│  │  - Workflow types & parser (YAML)                                  │  │
│  │  - Step executors (37 step types)                                  │  │
│  │  - Variable interpolation                                          │  │
│  │  - Execution context & events                                      │  │
│  └────────────────────────────────────────────────────────────────────┘  │
└──────────────────────────────────────────────────────────────────────────┘
      ▲                ▲                      ▲                      ▲
      │ stdio          │ WebSocket            │ HTTP POST            │ REST
      ▼                ▼                      ▼                      ▼
┌──────────────┐ ┌─────────────────┐ ┌─────────────────┐ ┌───────────────────┐
│Claude Desktop│ │ Chrome Extension│ │   Claude Code   │ │   Mobile App      │
│ (MCP stdio)  │ │ (Browser Auto)  │ │   (MCP SSE)     │ │   (REST Client)   │
└──────────────┘ └─────────────────┘ └─────────────────┘ └───────────────────┘

Project Structure

sidebutton/
├── packages/
│   ├── core/              # @sidebutton/core — workflow engine
│   │   └── src/
│   │       ├── types.ts       # Workflow types
│   │       ├── parser.ts      # YAML loader
│   │       ├── executor.ts    # Workflow runner
│   │       └── steps/         # Step implementations
│   ├── server/            # @sidebutton/server — MCP + HTTP + CLI
│   │   ├── bin/               # CLI entry point
│   │   └── src/
│   │       ├── server.ts      # Fastify HTTP server
│   │       ├── stdio-mode.ts  # stdio transport entry point
│   │       ├── extension.ts   # WebSocket client
│   │       ├── mcp/           # MCP handlers
│   │       │   ├── handler.ts    # MCP JSON-RPC logic
│   │       │   ├── stdio.ts      # stdio transport adapter
│   │       │   └── tools.ts      # Tool definitions
│   │       └── cli.ts         # Commander CLI
│   └── dashboard/         # Svelte web UI
│       └── src/
│           ├── App.svelte
│           └── lib/
├── extension/             # Chrome extension
├── workflows/             # Public workflow library
├── actions/               # User-created workflows
├── skills/                # Installed skill packs
└── run_logs/              # Execution history

Environment Variables

VariableRequired ForDescription
OPENAI_API_KEYllm.* stepsOpenAI API key for LLM workflows
ANTHROPIC_API_KEYllm.* stepsAnthropic API key (alternative)

Development

pnpm install       # Install dependencies
pnpm build         # Build all packages
pnpm start         # Start server
pnpm cli list      # List workflows
pnpm cli status    # Check status

Watch Mode

pnpm dev              # Full dev mode (all packages)
pnpm dev:server       # Server with auto-restart on :9876
pnpm dev:dashboard    # Dashboard watch build
pnpm dev:core         # Core library watch build

Platform Automation Disclaimer

SideButton is a general-purpose browser automation framework. When automating third-party platforms:

  • Review Terms of Service: Many platforms prohibit or restrict automation. You are responsible for complying with the terms of any platform you automate.
  • Account Risk: Automation may result in account restrictions or suspension on some platforms.
  • Use Responsibly: Only automate actions you would perform manually. Respect rate limits and platform guidelines.

The authors do not endorse or encourage violations of third-party terms of service.

Legal

License

This project uses mixed licensing. See LICENSING.md for details.

Servidores relacionados

NotebookLM Web Importer

Importe páginas da web e vídeos do YouTube para o NotebookLM com um clique. Confiado por mais de 200.000 usuários.

Instalar extensão do Chrome