XAIP MCP Server

为AI代理在XRPL上提供持久的链上身份——DID、凭证、声誉评分、托管和记忆链。

文档

XAIP — Signed Execution Receipts for AI Agent Tool Calls

Evidence before delegation. Wrap an agent's tool calls once; use the receipt history locally today, and share the same signed receipts later as portable, independently verifiable evidence.

XAIP is a provider-neutral signed execution evidence layer for AI agent tool calls. It records co-signed receipts — both the executing agent and the caller sign the same canonical record, so neither side can unilaterally fabricate one — across MCP, LangChain.js, OpenAI-compatible tool-call loops, and other runtimes, then exposes historical execution evidence that agents, developers, or policy layers can inspect before delegation.

Receipts are the primary artifact. Trust scores are one derived view over those receipts — not a claim of absolute safety or correctness.

Trust Evidence Before Delegation — XAIP demo

Live demo: three candidate servers, side-by-side comparison without and with XAIP. Open in browser →

Provider-agnostic by design. XAIP is a trust layer for any tool-using agent. The reference implementation and live data start with MCP (Model Context Protocol) — because that's where the broadest fleet of public tool servers exists today — but the receipt format, signing, and scoring apply equally to LangChain tools, OpenAI function calling, A2A, and proprietary agent stacks. MCP is the first integration, not the only one.

Live dashboard: https://xkumakichi.github.io/xaip-protocol/ — current public trust scores, auto-refreshed, no auth. The current public dataset is MCP-heavy because MCP was the first integration target.

Entry points

Try It Now

The API is live. No signup, no API key.

# Check trust score for a scored tool server
curl https://xaip-trust-api.kuma-github.workers.dev/v1/trust/context7

# Batch query
curl "https://xaip-trust-api.kuma-github.workers.dev/v1/trust?slugs=context7,sequential-thinking,filesystem"

# Decision engine: rank candidates by available execution evidence
curl -X POST https://xaip-trust-api.kuma-github.workers.dev/v1/select \
  -H "Content-Type: application/json" \
  -d '{"task":"Fetch React docs","candidates":["context7","sequential-thinking","unknown-server"]}'

The /v1/select response tells you which server to use, why, and what would happen without XAIP:

{
  "selected": "context7",
  "reason": "Highest trust among scored candidates based on current verified receipts",
  "rejected": [{ "slug": "unknown-server", "reason": "unscored — no execution evidence available" }],
  "withoutXAIP": "Random selection would pick an unscored server 33% of the time — no execution evidence available"
}

The Problem

Without trust scores, your agent is gambling:

┌────────────────┬────────────────┬───────────┬──────────────┐
│ Strategy       │ Server Hit     │ Success   │ Latency      │
├────────────────┼────────────────┼───────────┼──────────────┤
│ With XAIP      │ context7       │ ✓         │ ~3s          │
│ Random         │ unknown-mcp    │ ✗ error   │ ~8s (wasted) │
│ Try all (seq)  │ 3 servers      │ 1/3       │ ~11s total   │
└────────────────┴────────────────┴───────────┴──────────────┘

XAIP helps agents prefer candidates with stronger available execution evidence, skip unscored candidates when appropriate, and reduce avoidable failed calls.

How It Works

1. Select    POST /v1/select → ranks candidates by available execution evidence
2. Execute   Your agent calls the selected tool server
3. Report    POST /receipts → signed execution receipt feeds back into trust scores

Every execution receipt is Ed25519-signed and verified. Trust scores are computed using a Bayesian model with caller diversity weighting — not self-reported metrics.

Quick Start

Run the end-to-end demo

git clone https://github.com/xkumakichi/xaip-protocol.git
cd xaip-protocol/demo
npm install
npx tsx dogfood.ts

This demo:

  1. Asks XAIP to rank candidate servers for "Fetch React hooks documentation" by available execution evidence
  2. Connects to the selected MCP server and executes real tool calls
  3. Submits a signed execution receipt to the Aggregator
  4. Shows the updated trust score

Decision quality demo

Compare blind selection strategies against XAIP-guided selection using a static trust snapshot and fixed candidate sets:

cd demo
npm run blind-vs-xaip

This is a deterministic local replay. It does not perform live tool execution, post receipts, or call any external API. See docs/blind-vs-xaip-demo.md for scope, metrics, and limitations.

In the included snapshot replay:

StrategyRisky pick rateEligible pick rate
Random71.4%28.6%
Fixed-order85.7%14.3%
XAIP14.3%85.7%

risky_pick = selected candidate was low_trust or unscored in the snapshot. fixed-order models an agent that accepts the upstream planner's candidate order without runtime trust data. The claim is limited to this fixed candidate set and static trust snapshot — not a guarantee of real-world execution improvement.

Become an independent caller

Want the trust graph to depend on more than one operator? Run a caller yourself. No account, no approval, no API key — the aggregator verifies signatures from any valid keypair.

Fastest — zero-install, 30 seconds:

npx xaip-caller

Signs receipts for a handful of real HTTP tool calls and POSTs them. Demonstrates that XAIP works beyond MCP — any HTTP tool can participate. See clients/caller. See Run xaip-caller for Windows notes and external receipt contribution details.

Full path — MCP servers, 5 minutes:

Clone the repo and run the auto-collector against real MCP servers. Your caller DID contributes to the diversity of every scored MCP tool. See docs/contributor/run-a-caller.md.

Use the SDK

npm install xaip-sdk
import { precheck } from "xaip-sdk";

const result = await precheck({
  task: "Fetch React documentation",
  candidates: ["context7", "memory", "unknown-server"],
  includeDecision: true,
});

console.log(result.selected); // e.g. "memory" or null
console.log(result.decision); // "allow", "warn", or "unknown"

precheck() is a thin SDK wrapper over POST /v1/select. It returns available execution evidence for tool, skill, or agent candidates before your code decides what to delegate.

See the precheck() API guide for boundaries, policy options, result shape, and errors.

MCP Server

Use XAIP directly from Claude, Cursor, or any MCP-compatible AI agent:

npx xaip-mcp-trust

4 tools: xaip_list_servers, xaip_check_trust, xaip_select, xaip_report

Add to Claude Code (~/.claude/claude_desktop_config.json):

{
  "mcpServers": {
    "xaip-trust": {
      "command": "npx",
      "args": ["-y", "xaip-mcp-trust"]
    }
  }
}

npm: xaip-mcp-trust

API Reference

MethodEndpointDescription
GET/v1/serversList all scored servers with trust data
GET/v1/trust/:slugTrust score for a single scored server
GET/v1/trust?slugs=a,b,cBatch trust scores (max 50)
POST/v1/selectDecision engine — rank candidates by available execution evidence
GET/healthLiveness probe

Base URL: https://xaip-trust-api.kuma-github.workers.dev

Trust Score Response

FieldTypeDescription
trustnumber | null0.0–1.0 score, null if unscored
verdictstringtrusted ≥0.7 · caution 0.4–0.7 · low_trust <0.4 · unscored
receiptsnumberTotal verified execution receipts
confidencenumber | nullStatistical confidence: min(1, receipts/100)
riskFlagsstring[]Detected risk indicators
computedFromstringData provenance description

Decision Engine (POST /v1/select)

Request:

{
  "task": "description of what your agent needs to do",
  "candidates": ["server-a", "server-b", "server-c"],
  "mode": "relative"
}
  • mode: "relative" (default) — always selects the best available, even if below threshold
  • mode: "strict" — rejects all candidates below caution threshold

Architecture

┌──────────────────────────────────────────────────────────┐
│  Your AI Agent                                           │
│  ┌──────────┐   ┌───────────┐   ┌─────────────────────┐ │
│  │ Select   │──▶│ Execute   │──▶│ Report Receipt      │ │
│  │ (Trust   │   │ (MCP call)│   │ (Ed25519 signed)    │ │
│  │  API)    │   └───────────┘   └──────────┬──────────┘ │
│  └────┬─────┘                              │            │
└───────┼────────────────────────────────────┼────────────┘
        │                                    │
        ▼                                    ▼
┌───────────────┐                 ┌──────────────────────┐
│  Trust API    │◀────────────────│  Aggregator (BFT)    │
│  + Decision   │  Service        │  Cloudflare D1       │
│    Engine     │  Binding        │  Ed25519 verification│
└───────────────┘                 │  Bayesian scoring    │
                                  └──────────────────────┘

Trust Model:

  • Bayesian Beta distribution (prior varies by DID method)
  • Caller diversity weighting (prevents single-caller gaming)
  • Co-signature factor (dual Ed25519: agent + caller)
  • BFT-capable federation with MAD outlier detection across aggregator nodes; the current public deployment is a single aggregator (quorum:1)

Infrastructure:

  • Cloudflare Workers (global edge, <50ms latency)
  • Cloudflare D1 (SQLite at edge) for receipt storage
  • Service Bindings for Worker-to-Worker communication

Optional Ledger-Backed Identity

XAIP is not a blockchain protocol and not a payment rail.

The current identity model supports multiple DID methods, including did:key, did:web, and ledger-backed identifiers such as did:xrpl. Trust scores are derived from signed execution receipts — not token holdings, payments, or chain affiliation.

The default priors below are deployment policy, not a universal claim about trust:

DID MethodDefault PriorUse Case
did:xrpl[5, 1]Ledger-backed agents (one externally anchored option)
did:web[2, 1]Domain-verified servers
did:key[1, 1]Anonymous / new agents

Ledger-backed identities may be useful when an agent needs an externally anchored identity. XAIP itself does not require any ledger.

Data

Trust scores are computed from real execution data, not synthetic benchmarks:

  • ~4,500 signed receipts across 10 scored MCP servers as of 2026-06-12: context7, sequential-thinking, memory, filesystem, everything, fetch, sqlite, git, puppeteer, playwright
  • Automated daily data collection via GitHub Actions
  • Scores update with every new execution receipt; see the live dashboard/API for current values
# See all scored servers
curl https://xaip-trust-api.kuma-github.workers.dev/v1/servers

Works With

Runtime / integrationStatusHow
MCP (Model Context Protocol)Live public dataset (10 servers, ~4,500 signed receipts as of 2026-06-12)xaip-claude-hook, xaip-sdk, xaip-mcp-trust
LangChain.jsTested preview + live receipts landedxaip-langchain callback handler
OpenAI-compatible tool-call loopsTested preview + live receipts landedxaip-openai wrapper
HTTP tools / A2A / proprietary runtimesSupported receipt flowxaip-sdk or direct signed receipt emission

The receipt schema is intentionally tool-system-agnostic: agentDid, callerDid, taskHash, resultHash, success, latencyMs, failureType, timestamp. Any agent framework that can hash inputs/outputs and sign with Ed25519 can contribute receipts.

See Emit XAIP receipts from anything for the provider-neutral receipt flow.

Status

v0.4.0 live; v0.5 draft in development (tool class taxonomy + observation/display plumbing).

  • v0.5 class metadata plumbing is live for observation/display.

  • Class-aware scoring remains a design note and is not used by current trust scores or /v1/select selection behavior.

  • Trust Score API (Cloudflare Worker, live)

  • Decision Engine (POST /v1/select)

  • Aggregator with BFT-capable federation support (public deployment currently single aggregator / quorum:1)

  • Ed25519 receipt signing + verification

  • Bayesian trust model with caller diversity

  • ~4,500 signed receipts across 10 scored MCP servers as of 2026-06-12

  • Automated daily data collection (GitHub Actions)

  • Published preview receipt producers: xaip-langchain, xaip-openai

  • MCP Server: xaip-mcp-trust

  • npm: [email protected]

  • v0.5 specification draft (tool class taxonomy; class-aware scoring is a design note only)

  • Multi-caller diversity mechanism verified (2+ caller identities, metric responds across 8 servers)

  • v0.5 class metadata plumbing (observation/display only — does not affect scoring or /v1/select)

  • Class-aware scoring (design note only — not live behavior)

  • Zero-install caller path: npx xaip-caller (30-second first contribution, demonstrates XAIP beyond MCP)

  • External operator callers (mechanism live, external adoption pending — run npx xaip-caller or the full guide)

Internet-Draft

The XAIP receipt wire format is published as an individual Internet-Draft on IETF Datatracker:

This is an individual Internet-Draft. It is not an IETF standard, not IETF-approved, and has no formal standing in the IETF standards process. The draft scope is the receipt wire format only — scoring, aggregation, and decision logic are deployment policy and out of scope of the draft itself.

Cite as (work in progress — cite the specific revision for reproducibility):

xkumakichi, "Signed Execution Receipts for AI Agent Tool Calls (XAIP Receipts)", Work in Progress, Internet-Draft, draft-xkumakichi-xaip-receipts-01, 19 June 2026, https://datatracker.ietf.org/doc/draft-xkumakichi-xaip-receipts/.

BibTeX and BibXML exports are available from the Datatracker page linked above.

Writing

  • Portable Trust — why trust infrastructure for AI agents must be provider-neutral and behavior-derived (dev.to · Zenn 日本語版)
  • Evidence Before Payment — the agent-payment stack describes the transaction at hand; portable evidence of a counterparty's prior execution stays thin. Defines that design problem, independent of any one implementation (article)

Related

  • xaip-caller — zero-install CLI: npx xaip-caller to contribute to the trust graph
  • xaip-mcp-trust — MCP server for AI agents to check trust scores
  • xaip-langchain — LangChain.js callback handler that emits XAIP receipts
  • xaip-openai — OpenAI tool-calling wrapper with signed receipts
  • Veridict — earlier runtime execution logging experiment that informed XAIP's receipt-first design. Previously published as an npm package; not maintained as an independent product.
  • XAIP Specification v0.4 — Current protocol specification
  • XAIP Specification v0.5 RC — Release candidate (tool class taxonomy)

License

MIT