ArXiv-MCP
Search and retrieve academic papers from arXiv based on keywords.
A Model Context Protocol (MCP) server implementation that enables AI agents to search, retrieve, and analyze academic papers from arXiv, the popular open-access repository of electronic preprints.
Overview
This project provides an MCP server that allows AI agents to interact with the arXiv repository, search for papers based on keywords, retrieve detailed information about specific papers, and even extract and analyze the content of papers. It serves as both a practical tool for research assistance and a reference implementation for building MCP servers.
The implementation follows the best practices laid out for building MCP servers, allowing seamless integration with any MCP-compatible client.
Features
The server provides several powerful tools for academic research:
search_papers
: Search for papers on arXiv using keywords and get comprehensive summariesget_paper_details
: Retrieve detailed information about a specific paper by its arXiv IDextract_paper_content
: Download and extract the full text content from a paper's PDFanalyze_paper
: Analyze a paper's content and generate a comprehensive summary
Prerequisites
- Python 3.11+
- Docker if running the MCP server as a container (recommended)
Installation
Using uv
-
Install uv if you don't have it:
pip install uv
-
Clone this repository:
git clone https://github.com/kelvingao/arxiv-mcp.git cd arxiv-mcp
-
Install dependencies:
uv pip install -e .
-
Create a
.env
file based on.env.example
:cp .env.example .env
-
Configure your environment variables in the
.env
file (see Configuration section)
Using Docker (Recommended)
-
Build the Docker image:
docker build -t mcp/arxiv --build-arg PORT=8050 .
-
Create a
.env
file based on.env.example
and configure your environment variables
Configuration
The following environment variables can be configured in your .env
file:
Variable | Description | Example |
---|---|---|
TRANSPORT | Transport protocol (sse or stdio) | sse |
HOST | Host to bind to when using SSE transport | 0.0.0.0 |
PORT | Port to listen on when using SSE transport | 8050 |
Running the Server
Using uv
SSE Transport
# Set TRANSPORT=sse in .env then:
python src/server.py
The MCP server will run as an API endpoint that you can connect to with the configuration shown below.
Stdio Transport
With stdio, the MCP client itself can spin up the MCP server, so nothing to run at this point.
Using Docker
SSE Transport
docker run --env-file .env -p 8050:8050 mcp/arxiv
The MCP server will run as an API endpoint within the container that you can connect to with the configuration shown below.
Stdio Transport
With stdio, the MCP client itself can spin up the MCP server container, so nothing to run at this point.
Integration with MCP Clients
SSE Configuration
Once you have the server running with SSE transport, you can connect to it using this configuration:
{
"mcpServers": {
"arxiv": {
"transport": "sse",
"url": "http://localhost:8050/sse"
}
}
}
Note for Windsurf users: Use
serverUrl
instead ofurl
in your configuration:{ "mcpServers": { "arxiv": { "transport": "sse", "serverUrl": "http://localhost:8050/sse" } } }
Note for n8n users: Use host.docker.internal instead of localhost since n8n has to reach outside of its own container to the host machine:
So the full URL in the MCP node would be: http://host.docker.internal:8050/sse
Make sure to update the port if you are using a value other than the default 8050.
Python with Stdio Configuration
Add this server to your MCP configuration for Claude Desktop, Windsurf, or any other MCP client:
{
"mcpServers": {
"arxiv": {
"command": "your/path/to/arxiv-mcp/.venv/bin/python",
"args": ["your/path/to/arxiv-mcp/src/main.py"],
"env": {
"TRANSPORT": "stdio"
}
}
}
}
Docker with Stdio Configuration
{
"mcpServers": {
"arxiv": {
"command": "docker",
"args": ["run", "--rm", "-i",
"-e", "TRANSPORT",
"mcp/arxiv"],
"env": {
"TRANSPORT": "stdio"
}
}
}
}
Usage Examples
Here are some examples of how to use the arXiv MCP server with an AI agent:
Searching for Papers
Find recent papers about quantum computing published in the last year.
Getting Paper Details
Get details for the paper with arXiv ID 2303.08774
Extracting Paper Content
Extract the full text from the paper with arXiv ID 2303.08774
Analyzing a Paper
Analyze the methodology section of the paper with arXiv ID 2303.08774
Building Your Own MCP Server
This implementation provides a foundation for building more complex MCP servers. To build your own:
- Add your own tools by creating methods with the
@mcp.tool()
decorator - Create your own lifespan function to add your own dependencies (clients, database connections, etc.)
- Modify the existing tools or add new ones to enhance functionality
- Add prompts and resources with
@mcp.resource()
and@mcp.prompt()
License
Acknowledgements
- arXiv for providing open access to research papers
- The Model Context Protocol team for creating the MCP standard
Related Servers
YouTube Music MCP
Search and play tracks on YouTube Music via AI assistants.
Entscheidsuche
Search Swiss legal decisions using the entscheidsuche.ch API.
Wikipedia
Retrieves information from Wikipedia to provide context to Large Language Models (LLMs).
DigiKey
Search for electronic components and products using the DigiKey Product Search API.
Google News
Google News search capabilities with automatic topic categorization and multi-language support via SerpAPI integration.
Plex MCP Server
Search your Plex media library. Supports OAuth and static token authentication.
Haloscan MCP Server
An MCP server for interacting with the Haloscan SEO API.
People Data Labs
Access person, company, school, location, job title, and skill data using the People Data Labs API.
Semiconductor Supply Chain MCP Server
Access semiconductor B2B platforms like AnySilicon and DesignReuse for IP core and ASIC service procurement.
Amadeus MCP Server
Search for flight offers using the Amadeus Flight Offers Search API.