Agy Bridge MCP Server

MCP bridge that lets Claude Code delegate heavy tasks to the Antigravity CLI (agy) — purpose-built tools, model routing with fallback, session continuity, and output truncation to save Claude's context and tokens.

Documentation

agy-bridge

CI npm version npm downloads node license

An MCP bridge that lets Claude Code delegate heavy tasks to the Antigravity CLI (agy) — saving Claude's context window and tokens for what matters.

Claude sends a task → the bridge routes it to the best available model via agy → only the answer comes back. Large files, deep git searches, and web lookups never touch Claude's context.

User → Claude Code → agy-bridge (MCP) → agy CLI → Gemini / Claude / GPT-OSS
                   ←                  ←          ←

Why this over claude-to-agy?

claude-to-agyagy-bridge
Tool surface1 generic delegate_to_agy6 purpose-built tools — Claude self-routes reliably
Model selectionnone (agy default only)per-tool routing across all agy models, with availability detection and fallback
Multi-turnstatelesssession continuity — follow_up resumes agy conversations without resending context
Output safetyunboundedconfigurable truncation cap protects Claude's context
Sandboxnooptional --sandbox mode
Installuvx (Python)npx (Node) — zero install

Requirements

Install

# 1. Register the MCP server (user scope = all projects)
claude mcp add -s user agy-bridge npx -- -y agy-bridge

# 2. Add delegation rules to your project (or ~/.claude/CLAUDE.md for global)
curl -o CLAUDE.md https://raw.githubusercontent.com/sshahzaiib/agy-bridge/main/CLAUDE.md

Tools

ToolUse forModel routing (first available)
analyze_filesFiles >200 lines, >3 files at once, logs, dumps, generated codeGemini 3.5 Flash (High) → Gemini 3.1 Pro (Low)
deep_searchgit log/diff/blame archaeology, repo-wide grepsGemini 3.5 Flash (Medium) → (High)
web_lookupDocs, API references, external/current knowledgeGemini 3.5 Flash (Medium) → (High)
adversarial_reviewPlan critiques, design and code reviewsGemini 3.1 Pro (High) → Claude Opus 4.6 (Thinking) → Flash (High)
follow_upContinue a prior session by session_id — no context resendinherits the session
delegateAnything else heavyGemini 3.5 Flash (High)

All tools accept optional cwd (project root) and model (exact name from agy models; validated, with available models listed on mismatch).

Every response ends with a footer:

---
[agy-bridge] model: Gemini 3.5 Flash (High) | session: 1f0c…-d4 (use follow_up to continue)

Model routing

On first use the bridge runs agy models (cached for the process lifetime) and picks the first available model in the tool's preference chain. If none is available it falls back to AGY_DEFAULT_MODEL, and finally to agy's own default. agy silently ignores unknown --model values, so the bridge validates names up front instead of letting requests land on the wrong model.

Quota-aware failover

agy never surfaces quota exhaustion in print mode — it silently retries the 429 until its print-timeout, then exits 0 with empty output, which used to look like an indefinite hang. The bridge now watches each run's log file (via --log-file) and on RESOURCE_EXHAUSTED (code 429):

  1. kills the agy process group immediately (no waiting out the timeout),
  2. parses the reset time ("Resets in 4h24m") into an in-process cooldown registry,
  3. retries the same prompt on the next model in the tool's chain,
  4. skips cooled-down models on all subsequent calls until their quota resets.

Failovers are annotated in the response footer (failover: <model>: quota exhausted (resets in 4h24m)). Only when every candidate is exhausted does the call fail — in seconds, with reset times listed — instead of hanging.

Timeouts and cancellation

Each tool has its own default timeout sized to its job: web_lookup 120s, deep_search 180s, analyze_files / adversarial_review / follow_up 300s, delegate 600s. Setting AGY_TIMEOUT explicitly overrides all of them. The kill path escalates SIGTERM → SIGKILL across the whole process group, and the deadline fires even if agy's helper processes hold the output pipes open. Cancelling the tool call from the MCP client (e.g. pressing Esc in Claude Code) also kills the agy run instead of orphaning it.

Configuration

All optional, via environment variables:

VariableDefaultDescription
AGY_PATHagyPath to the agy binary
AGY_TIMEOUTper-toolSeconds; overrides the per-tool timeouts (see above), passed as --print-timeout, enforced with a 15s kill grace
AGY_MAX_OUTPUT_CHARS50000Truncation cap for tool output
AGY_DEFAULT_MODELunsetFallback model when no chain entry is available
AGY_SKIP_PERMISSIONStruePass --dangerously-skip-permissions to agy
AGY_SANDBOXfalseRun agy with --sandbox
AGY_ON_FAILUREfallbackstrict appends an instruction to failed-tool errors telling the calling agent not to absorb the work itself

Failure behavior

The bridge always fails loudly: agy errors surface as MCP tool errors with agy's actual stderr, and degraded model routing is annotated in the response footer. By default the calling agent (Claude) will typically do the work itself after a failure — visible in the transcript, but easy to stop noticing in a long session. Set AGY_ON_FAILURE=strict to append an explicit "do NOT perform this work yourself — report the failure to the user" instruction to every delegation error, so you keep control over when token savings are silently lost.

Development

npm install
npm test           # vitest unit tests (exec mocked — no agy needed)
npm run typecheck
npm run build      # tsup → dist/index.js

Contributors

Contributions are welcome — open an issue or PR.

Contributors

Star History

Star History Chart

License

MIT