Scorecard
Access Scorecard's AI model evaluation and testing tools via a Cloudflare Workers deployment.
Scorecard MCP Server on Cloudflare
This repository allows you to deploy a remote MCP server on Cloudflare Workers that enables Claude and other MCP clients to access Scorecard's evaluation tools.
Get started:
This will deploy your Scorecard MCP server to a URL like: scorecard-mcp.<your-account>.workers.dev/sse
Alternatively, you can clone this repository and deploy it using Wrangler:
git clone https://github.com/scorecard-ai/scorecard-mcp.git
cd scorecard-mcp
npm install
npm run deploy
About This MCP Server
This MCP server provides access to Scorecard's evaluation tools directly from Claude and other MCP-compatible clients. It uses Clerk for authentication and is built on Cloudflare Workers for reliable, global deployment.
The server implements the MCP specification (2025-03-26) and provides secure access to Scorecard's API for running experiments, generating synthetic data, configuring metrics, and analyzing model performance.
Connect to MCP Clients
This MCP server works with various MCP-compatible clients:
Connect to claude.ai, Cursor, and Windsurf
Once deployed, you can connect to your MCP server from Claude and other MCP-compatible clients by providing your server URL:
https://scorecard-mcp.<your-account>.workers.dev/sse
Connect via Cloudflare AI Playground
You can also connect through the Cloudflare AI Playground:
- Go to https://playground.ai.cloudflare.com/
- Enter your deployed MCP server URL (
scorecard-mcp.<your-account>.workers.dev/sse) - You can now use Scorecard's evaluation tools directly from the playground!
Connect via Claude Desktop
For local testing, you can connect to your MCP server from Claude Desktop by using the mcp-remote proxy.
Follow Anthropic's Quickstart and within Claude Desktop go to Settings > Developer > Edit Config.
Update with this configuration:
{
"mcpServers": {
"scorecard": {
"command": "npx",
"args": [
"mcp-remote",
"https://scorecard-mcp.<your-account>.workers.dev/sse" // or http://localhost:8787/sse for local testing
]
}
}
}
Restart Claude and you should see the tools become available.
Local Development
For local development, create a ".dev.vars" file with your Clerk credentials:
cp .dev.vars.example .dev.vars
Configure the following variables in your .dev.vars file:
| Variable | Source | Notes |
|---|---|---|
| CLERK_CLIENT_ID | Clerk Dashboard -> Configure -> OAuth Applications | |
| CLERK_CLIENT_SECRET | Clerk Dashboard -> Configure -> OAuth Applications | Cannot be viewed after initial generation |
| CLERK_DOMAIN | Clerk Dashboard -> Configure -> API Keys -> Frontend API URL | Override this with the Clerk development URL if using with local Scorecard server |
| CLERK_PUBLISHABLE_KEY | Clerk Dashboard -> Configure -> API Keys -> Publishable Key | Override this with the pk_test_* one if using with local Scorecard server |
Then run the development server:
npm install
npm run dev
Remember to run npx wrangler types to generate types for the environment variables.
Contributors
Special thanks to Dustin Moore for his engineering leadership in developing this MCP implementation.
関連サーバー
Alpha Vantage MCP Server
スポンサーAccess financial market data: realtime & historical stock, ETF, options, forex, crypto, commodities, fundamentals, technical indicators, & more
Proxyman MCP
Proxyman MCP allows AI to inspect HTTP traffic, create debugging rules, and control Proxyman - all through natural language conversations.
Agent Passport System
Cryptographic identity, scoped delegation, values governance, and deliberative consensus for AI agents. 11 tools, Ed25519 signatures, zero blockchain.
ndlovu-code-reviewer
Manual code reviews are time-consuming and often miss the opportunity to combine static analysis with contextual, human-friendly feedback. This project was created to experiment with MCP tooling that gives AI assistants access to a purpose-built reviewer. Uses the Gemini cli application to process the reviews at this time and linting only for typescript/javascript apps at the moment. Will add API based calls to LLM's in the future and expand linting abilities. It's also cheaper than using coderabbit ;)
MCP SBOM Server
Performs a Trivy scan to produce a Software Bill of Materials (SBOM) in CycloneDX format.
SensorMCP Server
Automate dataset creation and train custom object detection models using natural language.
Markdown2PDF
Convert Markdown documents to PDF files with syntax highlighting, custom styling, and optional watermarking.
CAD-Query MCP Server
A server for generating and verifying CAD models using the CAD-Query Python library.
Baby-SkyNet
An autonomous memory management system for Claude AI, featuring multi-provider LLM integration and a persistent memory database.
Agent Module
Deterministic compliance and vertical knowledge bases for autonomous agents. Free 24hr trial.
CLI MCP Server
A secure MCP server for executing controlled command-line operations with comprehensive security features.