MCP Bench Router
Claude Code sucks at design. Let it delegate it's tasks to better models. Claude will use the MCP to get leaderboard of best design models and query specific code changes using OpenRouter.
mcp-bench-router
MCP server that delegates design tasks to whichever model currently tops designarena.ai's crowdsourced leaderboard, via OpenRouter.
Why?
AI coding assistants produce mediocre frontend/design output. designarena.ai maintains a live leaderboard of the best design models, updated every 2 hours. This MCP server lets Claude (or any MCP client) automatically route design tasks to the current best model.
Tools
| Tool | Description |
|---|---|
get_best_design_model | Get the current #1 design model, optionally by category |
get_leaderboard | Browse full rankings with category filter and pagination |
query_design_model | Send a prompt to the best available model via OpenRouter |
query_specific_model | Send a prompt to a specific model via OpenRouter |
Setup
Prerequisites
- Node.js >= 18
- An OpenRouter API key (required for
query_design_modelandquery_specific_model)
Claude Code (CLI)
claude mcp add mcp-bench-router -s user -e OPENROUTER_API_KEY=sk-or-... -- npx -y mcp-bench-router
Claude Desktop
Add to ~/Library/Application Support/Claude/claude_desktop_config.json (macOS) or %APPDATA%\Claude\claude_desktop_config.json (Windows):
{
"mcpServers": {
"mcp-bench-router": {
"command": "npx",
"args": ["-y", "mcp-bench-router"],
"env": {
"OPENROUTER_API_KEY": "sk-or-..."
}
}
}
}
Cursor
Add to .cursor/mcp.json in your project root:
{
"mcpServers": {
"mcp-bench-router": {
"command": "npx",
"args": ["-y", "mcp-bench-router"],
"env": {
"OPENROUTER_API_KEY": "sk-or-..."
}
}
}
}
Other MCP clients
OPENROUTER_API_KEY=sk-or-... npx -y mcp-bench-router
From source
git clone https://github.com/K1ta141k/mcp-bench-router.git
cd mcp-bench-router
npm install
npm run build
Categories
Filter by design category: allcategories, website, gamedev, 3d, dataviz, uicomponent, image, logo, svg, video, imagetoimage, slides, graphicdesign, tts.
Usage Examples
Check who's #1:
"Use get_best_design_model to see the current top design model"
Browse rankings:
"Show me the top 20 design models for websites"
Generate a design:
"Use query_design_model to create a landing page for a SaaS product"
Use a specific model:
"Use query_specific_model with claude-sonnet-4-5 to design a dashboard"
How It Works
- Fetches live rankings from designarena.ai's API (cached 2hr)
- Maps arena model names to OpenRouter IDs (static + dynamic + fuzzy matching)
- Skips codename/unreleased models and picks the best available one
- Routes the prompt through OpenRouter's API
License
MIT
İlgili Sunucular
Scout Monitoring MCP
sponsorPut performance and error data directly in the hands of your AI assistant.
Alpha Vantage MCP Server
sponsorAccess financial market data: realtime & historical stock, ETF, options, forex, crypto, commodities, fundamentals, technical indicators, & more
USolver
A server for solving combinatorial, convex, integer, and non-linear optimization problems.
TokenTool MCP
Deploy and manage compliant tokens from Claude, Cursor, or any AI agent — by typing a sentence.
WinAiDbg MCP
AI-powered Windows crash dump analysis platform that provides structured access to Microsoft debugging tools through the Model Context Protocol, making complex crash investigation accessible to AI systems
302AI Custom MCP Server
A customizable MCP service with flexible tool selection and configuration. Requires a 302AI API key.
D2 MCP Server
Generate, render, and manipulate D2 diagrams with incremental editing capabilities.
NMAP
Perform network scanning and security auditing using the NMAP utility.
Python Local
An interactive Python REPL environment with persistent session history.
Change8
Breaking Change Alerts for Humans and AI Agents.
Lean LSP
Interact with the Lean theorem prover via the Language Server Protocol (LSP), enabling LLM agents to understand, analyze, and modify Lean projects.
Divvi MCP Server
Automatically integrate the Divvi referral SDK into JavaScript and TypeScript blockchain applications.