Trustwise

Advanced evaluation tools for AI safety, alignment, and performance using the Trustwise API.

🦉 Trustwise MCP Server

The Trustwise MCP Server is a Model Context Protocol (MCP) server that provides a suite of advanced evaluation tools for AI safety, alignment, and performance. It enables developers and AI tools to programmatically assess the quality, safety, and cost of LLM outputs using Trustwise's industry-leading metrics.

💡 Use Cases

Evaluating the safety and reliability of LLM responses.
Measuring alignment, clarity, and helpfulness of AI-generated content.
Estimating the carbon footprint and cost of model inference.
Integrating robust evaluation into AI pipelines, agents, or orchestration frameworks.

🛠️ Prerequisites

A Trustwise API Key (get one here)
Docker; Follow the install instructions

📦 Installation & Running

Claude Desktop

To connect the Trustwise MCP Server to Claude Desktop, add the following configuration to your Claude Desktop settings:

{
  "mcpServers": {
    "trustwise": {
      "command": "docker",
      "args": [
        "run",
        "-i",
        "--rm",
        "-e",
        "TW_API_KEY",
        "ghcr.io/trustwiseai/trustwise-mcp-server:latest"
      ],
      "env": {
        "TW_API_KEY": "<YOUR_TRUSTWISE_API_KEY>"
      }
    }
  }
}

To point to a specific Trustwise Instance - under env, also set the following optional environment variable:

TW_BASE_URL: "<YOUR_TRUSTWISE_INSTANCE_URL>"

e.g "TW_BASE_URL": "https://api.yourdomain.ai"

Cursor

To connect the Trustwise MCP Server to cursor, add the following configuration to your cursor settings:

{
  "mcpServers": {
    "trustwise": {
      "command": "docker",
      "args": [
        "run",
        "-i",
        "--rm",
        "-e",
        "TW_API_KEY",
        "-e",
        "TW_BASE_URL",
        "ghcr.io/trustwiseai/trustwise-mcp-server:latest"
      ],
      "env": {
        "TW_API_KEY": "<YOUR_TRUSTWISE_API_KEY>"
      }
    }
  }
}

Replace <YOUR_TRUSTWISE_API_KEY> with your actual Trustwise API key.

🧰 Tools

The Trustwise MCP Server exposes the following tools (metrics). Each tool can be called with the specified arguments to evaluate a model response.

🛡️ Trustwise Metrics

Tool Name	Description
`faithfulness_metric`	Evaluate the faithfulness of a response to its context
`answer_relevancy_metric`	Evaluate relevancy of a response to the query
`context_relevancy_metric`	Evaluate relevancy of context to the query
`pii_metric`	Detect PII in a response
`prompt_injection_metric`	Detect prompt injection risk
`summarization_metric`	Evaluate summarization quality
`clarity_metric`	Evaluate clarity of a response
`formality_metric`	Evaluate formality of a response
`helpfulness_metric`	Evaluate helpfulness of a response
`sensitivity_metric`	Evaluate sensitivity of a response
`simplicity_metric`	Evaluate simplicity of a response
`tone_metric`	Evaluate tone of a response
`toxicity_metric`	Evaluate toxicity of a response
`refusal_metric`	Detect refusal to answer or comply with the query
`completion_metric`	Evaluate completion of the query’s instruction
`adherence_metric`	Evaluate adherence to a given policy or instruction
`stability_metric`	Evaluate stability (consistency) of multiple responses
`carbon_metric`	Estimate carbon footprint of a response
`cost_metric`	Estimate cost of a response

For more examples and advanced usage, see the official Trustwise SDK.

📄 License

This project is licensed under the terms of the MIT open source license. See LICENSE for details.

🔒 Security

Do not commit secrets or API keys.
This repository is public; review all code and documentation for sensitive information before pushing.

Serveurs connexes

Alpha Vantage MCP Server

sponsor

Access financial market data: realtime & historical stock, ETF, options, forex, crypto, commodities, fundamentals, technical indicators, & more

DevContext

Provides developers with continuous, project-centric context awareness. Requires a TursoDB database.

Figma Copilot

Enables AI assistants to interact with and automate Figma designs programmatically.

SeaLights

An MCP server for interacting with the SeaLights platform for quality intelligence.

MCP-Logic

Provides automated reasoning for AI systems using the Prover9 and Mace4 theorem provers.

Math MCP Learning

Educational MCP server with math operations, statistics, visualizations, and persistent workspace.

MCP Repo Search Server

MCP server that gives LLMs structural code intelligence across multiple repos

MCP RAG Server

A Python server providing Retrieval-Augmented Generation (RAG) functionality. It indexes various document formats and requires a PostgreSQL database with pgvector.

MCP Sequence Simulation Server

Simulate DNA and amino acid sequences using evolutionary models and algorithms.

Tencent Cloud Code Analysis

An official MCP server for Tencent Cloud Code Analysis (TCA) to quickly start code analysis and obtain reports.

Interactive Feedback MCP

An MCP server for interactive user feedback and command execution in AI-assisted development.