Agent Evals by Galileo

Bring agent evaluations, observability, and synthetic test set generation directly into your IDE for free with Galileo's new MCP server

ℹ️ These docs are for the v2.0 version of Galileo. Documentation for v1.0 version can be found here.

light logo

Galileo MCP Server

The Galileo Model Context Protocol (MCP) server enables seamless integration between AI-powered IDEs, such as Cursor, or VS Code with GitHub Copilot, and Galileo’s evaluation and observability platform. With MCP, you can access Galileo’s capabilities directly from your development environment, including:

  • Creating and managing datasets
  • Running experiments
  • Setting up prompt templates
  • Getting insights on Log streams
  • Integrating Galileo with your code

Prerequisites

Before you begin, ensure you have the following:

1

AI-enabled IDE

Install an AI-enabled IDE such as Cursor or VS Code with AI capabilities

2

API key

Generate your Galileo API key from the API keys page

Configure your IDE

The Galileo MCP server works with both Cursor and VS Code. Follow the steps below for your IDE:

  • VS Code
  • Cursor

1

Install GitHub Copilot

Install the GitHub Copilot extension if you haven’t already

2

Open MCP settings

Open the Command Palette (Ctrl + Shift + P on Windows/Linux, or Cmd + Shift + P on Mac) and search for “MCP: Open User Configuration”

3

Add the Galileo MCP server configuration

Copy and paste the configuration below. Replace YOUR-API-KEY with your actual Galileo API key.

VSCode MCP Configuration

{
  "servers": {
    "galileo_mcp_server": {
      "url": "https://api.galileo.ai/mcp/http/mcp",
      "headers": {
        "Galileo-API-Key": "YOUR-API-KEY",
        "Accept": "text/event-stream"
      }
    }
  },
  "inputs": []
}

4

Reload VS Code

Reload VS Code by opening the Command Palette and running “Developer: Reload Window” for the changes to take effect

The configuration is the same for both Cursor and VS Code. Make sure to replace YOUR-API-KEY with your actual Galileo API key from the API keys page.

Verify your setup

Once configured, you can verify your MCP setup by asking your AI assistant in your IDE:

Example Query

Create Dataset Query

Integration Query

Can you show me how to add Galileo logging to my agent bot?

Your AI assistant should now be able to access Galileo’s capabilities and respond with information from your Galileo account.

Tools

The Galileo MCP server provides powerful tools that you can access through natural conversation with your AI assistant. Simply ask questions or make requests, and the AI will use these tools to help you.

Create Datasets

Generate synthetic datasets or upload your own data to test and evaluate your AI applications. The tool supports creating datasets with various types of queries including general queries, prompt injections, off-topic content, and toxic content scenarios.What you can ask:

Query

Create a dataset with 50 customer service queries about billing issues

Query

Generate a dataset of 30 chatbot queries, including some prompt injection attempts

Query

Make a dataset with product recommendation queries and include off-topic questions

Check Dataset Status

Track the progress of your dataset generation and preview the generated content. You’ll see the first 10 rows of data along with generation status and progress updates.What you can ask:

Query

Check the status of my dataset that's currently generating

Query

Show me the preview of dataset abc-123

Query

Is my customer service dataset ready yet?

Create Prompt Templates

Build reusable prompt templates that you can use across all your projects. Set up model configurations, temperature settings, and other parameters for consistent prompt behavior.What you can ask:

Query

Create a prompt template called "Friendly Assistant" for customer support responses

Query

Make a prompt template for summarizing technical documentation with lower temperature

Query

Set up a chat template for a code review assistant

Set Up Experiments

Get complete guidance on setting up and running Galileo experiments, including dataset preparation, metrics configuration, and integration with your existing code. Available for Python and other supported languages.What you can ask:

Query

How do I set up a Galileo experiment in Python?

Query

Show me how to run an experiment with my RAG application

Query

Guide me through creating an experiment for my agentic workflow

Get Log Stream Insights

Analyze your application’s Log streams to identify issues, patterns, and opportunities for improvement. Get specific recommendations based on your logged data.What you can ask:

Query

What issues do you see in my production Log stream?

Query

Analyze the customer-support project Log stream and suggest improvements

Query

Get insights about my chatbot Log stream from last week

Integrate with OpenAI

Get step-by-step integration guides for adding Galileo observability to your OpenAI applications. Automatically log prompts, responses, model parameters, and token usage with minimal code changes.What you can ask:

Query

How do I add Galileo logging to my OpenAI application?

Query

Show me how to integrate Galileo with OpenAI in TypeScript

Query

Help me set up Galileo observability for my GPT-4 chatbot

Integrate with LangChain

Get complete integration instructions for adding Galileo to your LangChain applications. Capture full traces of your chains, agents, and tools with automatic logging.What you can ask:

Query

How do I integrate Galileo with my LangChain application?

Query

Show me how to add Galileo tracing to my LangChain agent

Query

Help me log my LangChain RAG pipeline with Galileo

Search Documentation

Find relevant information, code examples, API references, and implementation guides across all Galileo documentation. Get direct links to the pages you need.What you can ask:

Query

How do I set up data logging in Galileo?

Query

Find documentation about custom metrics

Query

Search for examples of agentic AI evaluation

Example use cases

Create a synthetic dataset

Ask your AI assistant:

Query

Create a synthetic dataset with 20 customer service queries about product
returns. Include both general queries and off-topic queries.

The MCP server will guide you through the dataset creation process and provide a dataset ID to track progress.

Get integration help

Query

How do I integrate Galileo with my LangChain application in Python?

The MCP server will provide complete integration code examples and setup instructions.

Query

Get insights about my Log stream in the customer-support project

The MCP server will analyze your Log stream and suggest improvements.

Troubleshooting

Error connecting to Galileo MCP Server

  • Check that the MCP server URL is set to https://api.galileo.ai/mcp/http/mcp
  • Ensure the Accept header is set to text/event-stream
  • Restart your IDE MCP connection after making configuration changes

API key errors

  • Confirm your API key is properly set in the configuration
  • Check that your API key has not expired
  • Generate a new API key from the API keys page

Next steps

Log your first trace

Learn how to log your first trace with Galileo

Run your first experiment

Set up and run experiments to evaluate your AI applications

Explore integrations

Learn about Galileo integrations with third-party frameworks

Was this page helpful?

YesNo

Suggest editsRaise issue

Run an Experiment

Overview

Related Servers