Agent Evals by Galileo
Bring agent evaluations, observability, and synthetic test set generation directly into your IDE for free with Galileo's new MCP server
ℹ️ These docs are for the v2.0 version of Galileo. Documentation for v1.0 version can be found here.
Galileo MCP Server
The Galileo Model Context Protocol (MCP) server enables seamless integration between AI-powered IDEs, such as Cursor, or VS Code with GitHub Copilot, and Galileo’s evaluation and observability platform. With MCP, you can access Galileo’s capabilities directly from your development environment, including:
- Creating and managing datasets
- Running experiments
- Setting up prompt templates
- Getting insights on Log streams
- Integrating Galileo with your code
Prerequisites
Before you begin, ensure you have the following:
1
AI-enabled IDE
Install an AI-enabled IDE such as Cursor or VS Code with AI capabilities
2
API key
Generate your Galileo API key from the API keys page
Configure your IDE
The Galileo MCP server works with both Cursor and VS Code. Follow the steps below for your IDE:
- VS Code
- Cursor
1
Install GitHub Copilot
Install the GitHub Copilot extension if you haven’t already
2
Open MCP settings
Open the Command Palette (Ctrl + Shift + P on Windows/Linux, or Cmd + Shift + P on Mac) and search for “MCP: Open User Configuration”
3
Add the Galileo MCP server configuration
Copy and paste the configuration below. Replace YOUR-API-KEY with your actual Galileo API key.
VSCode MCP Configuration
{
"servers": {
"galileo_mcp_server": {
"url": "https://api.galileo.ai/mcp/http/mcp",
"headers": {
"Galileo-API-Key": "YOUR-API-KEY",
"Accept": "text/event-stream"
}
}
},
"inputs": []
}
4
Reload VS Code
Reload VS Code by opening the Command Palette and running “Developer: Reload Window” for the changes to take effect
The configuration is the same for both Cursor and VS Code. Make sure to replace YOUR-API-KEY with your actual Galileo API key from the API keys page.
Verify your setup
Once configured, you can verify your MCP setup by asking your AI assistant in your IDE:
Example Query
Create Dataset Query
Integration Query
Can you show me how to add Galileo logging to my agent bot?
Your AI assistant should now be able to access Galileo’s capabilities and respond with information from your Galileo account.
Tools
The Galileo MCP server provides powerful tools that you can access through natural conversation with your AI assistant. Simply ask questions or make requests, and the AI will use these tools to help you.
Create Datasets
Generate synthetic datasets or upload your own data to test and evaluate your AI applications. The tool supports creating datasets with various types of queries including general queries, prompt injections, off-topic content, and toxic content scenarios.What you can ask:
Query
Create a dataset with 50 customer service queries about billing issues
Query
Generate a dataset of 30 chatbot queries, including some prompt injection attempts
Query
Make a dataset with product recommendation queries and include off-topic questions
Check Dataset Status
Track the progress of your dataset generation and preview the generated content. You’ll see the first 10 rows of data along with generation status and progress updates.What you can ask:
Query
Check the status of my dataset that's currently generating
Query
Show me the preview of dataset abc-123
Query
Is my customer service dataset ready yet?
Create Prompt Templates
Build reusable prompt templates that you can use across all your projects. Set up model configurations, temperature settings, and other parameters for consistent prompt behavior.What you can ask:
Query
Create a prompt template called "Friendly Assistant" for customer support responses
Query
Make a prompt template for summarizing technical documentation with lower temperature
Query
Set up a chat template for a code review assistant
Set Up Experiments
Get complete guidance on setting up and running Galileo experiments, including dataset preparation, metrics configuration, and integration with your existing code. Available for Python and other supported languages.What you can ask:
Query
How do I set up a Galileo experiment in Python?
Query
Show me how to run an experiment with my RAG application
Query
Guide me through creating an experiment for my agentic workflow
Get Log Stream Insights
Analyze your application’s Log streams to identify issues, patterns, and opportunities for improvement. Get specific recommendations based on your logged data.What you can ask:
Query
What issues do you see in my production Log stream?
Query
Analyze the customer-support project Log stream and suggest improvements
Query
Get insights about my chatbot Log stream from last week
Integrate with OpenAI
Get step-by-step integration guides for adding Galileo observability to your OpenAI applications. Automatically log prompts, responses, model parameters, and token usage with minimal code changes.What you can ask:
Query
How do I add Galileo logging to my OpenAI application?
Query
Show me how to integrate Galileo with OpenAI in TypeScript
Query
Help me set up Galileo observability for my GPT-4 chatbot
Integrate with LangChain
Get complete integration instructions for adding Galileo to your LangChain applications. Capture full traces of your chains, agents, and tools with automatic logging.What you can ask:
Query
How do I integrate Galileo with my LangChain application?
Query
Show me how to add Galileo tracing to my LangChain agent
Query
Help me log my LangChain RAG pipeline with Galileo
Search Documentation
Find relevant information, code examples, API references, and implementation guides across all Galileo documentation. Get direct links to the pages you need.What you can ask:
Query
How do I set up data logging in Galileo?
Query
Find documentation about custom metrics
Query
Search for examples of agentic AI evaluation
Example use cases
Create a synthetic dataset
Ask your AI assistant:
Query
Create a synthetic dataset with 20 customer service queries about product
returns. Include both general queries and off-topic queries.
The MCP server will guide you through the dataset creation process and provide a dataset ID to track progress.
Get integration help
Query
How do I integrate Galileo with my LangChain application in Python?
The MCP server will provide complete integration code examples and setup instructions.
Query
Get insights about my Log stream in the customer-support project
The MCP server will analyze your Log stream and suggest improvements.
Troubleshooting
Error connecting to Galileo MCP Server
- Check that the MCP server URL is set to
https://api.galileo.ai/mcp/http/mcp - Ensure the
Acceptheader is set totext/event-stream - Restart your IDE MCP connection after making configuration changes
API key errors
- Confirm your API key is properly set in the configuration
- Check that your API key has not expired
- Generate a new API key from the API keys page
Next steps
Log your first trace
Learn how to log your first trace with Galileo
Run your first experiment
Set up and run experiments to evaluate your AI applications
Explore integrations
Learn about Galileo integrations with third-party frameworks
Was this page helpful?
YesNo
Suggest editsRaise issue
Run an Experiment
Overview
Related Servers
VeyraX
Single tool to control all 100+ API integrations, and UI components
Knowledge Graph Memory Server
Enables persistent memory for Claude using a local knowledge graph of entities, relations, and observations.
MCP LaTeX Server
Create, edit, and manage LaTeX files. Requires an external LaTeX distribution like MiKTeX, TeX Live, or MacTeX.
Berry MCP Server
A universal framework for easily creating and deploying Model Context Protocol servers with any tools.
MCP-Insomnia
An MCP server for AI agents to create and manage API collections in Insomnia-compatible format.
Sequential Thinking Multi-Agent System (MAS)
An MCP agent that utilizes a Multi-Agent System (MAS) for sequential thinking and problem-solving.
AWS CDK MCP Server
Offers guidance and tools for AWS CDK, covering best practices, security compliance with CDK Nag, infrastructure validation, and pattern discovery.
MCP Developer Name
Returns the current developer's information.
Coding Assistant Server
A coding assistant server that provides context-aware code suggestions, documentation integration, and technology detection.
OPNsense MCP Server
A comprehensive MCP server for managing OPNsense firewalls, offering over 300 tools for configuration and monitoring.