Anime MCP Server

An AI-powered anime search and recommendation system built with FastAPI, Qdrant vector database, FastMCP protocol, and LangGraph workflow orchestration. Features semantic search capabilities over 38,000+ anime entries with MCP integration for AI assistants.

Features

Semantic Search: Natural language queries for anime discovery
High Performance: Sub-200ms response times with vector embeddings
Comprehensive Database: 38,894 anime entries with rich metadata
MCP Protocol Integration: FastMCP server for AI assistant communication
Real-time Vector Search: Qdrant-powered semantic search
Modern Multi-Modal Search: Advanced image search with SigLIP/JinaCLIP v2 (512x512 resolution) and BGE-M3 text embeddings
Legacy Multi-Modal Search: Visual similarity and combined text+image search with CLIP embeddings
Conversational Workflows: LangGraph-powered intelligent conversation flows with ToolNode integration
Smart Orchestration: Advanced multi-step query processing with complexity assessment
AI-Powered Query Understanding: Natural language parameter extraction with LLM intelligence
Intelligent Parameter Extraction: Automatic limit, genre, year, and exclusion detection
Native LangGraph Integration: ToolNode-based workflow engine with ~200 lines less boilerplate
Docker Support: Easy deployment with containerized services

🏗️ Architecture

anime-mcp-server/
├── src/
│   ├── main.py                  # FastAPI application entry point
│   ├── config.py                # Centralized configuration management
│   ├── api/
│   │   ├── search.py            # Search endpoints
│   │   ├── admin.py             # Admin endpoints
│   │   ├── recommendations.py   # Recommendation endpoints (basic)
│   │   └── workflow.py          # LangGraph workflow endpoints
│   ├── anime_mcp/
│   │   ├── modern_server.py     # Modern MCP server with LangGraph workflows
│   │   ├── server.py            # Core MCP server implementation
│   │   ├── handlers/            # MCP request handlers
│   │   └── tools/               # Platform-specific MCP tools
│   ├── langgraph/
│   │   ├── langchain_tools.py         # LangChain tool creation & ToolNode workflow
│   │   ├── react_agent_workflow.py   # ReactAgent workflow engine
│   │   ├── anime_swarm.py             # Multi-agent swarm workflows
│   │   └── agents/                    # Specialized workflow agents
│   ├── vector/
│   │   ├── qdrant_client.py           # Vector database operations with modern embedding support
│   │   ├── modern_text_processor.py   # Modern text embeddings (BGE-M3, HuggingFace, Sentence Transformers)
│   │   ├── modern_vision_processor.py # Modern vision embeddings (SigLIP, JinaCLIP v2, CLIP)
│   │   └── vision_processor.py        # Legacy CLIP image processing
│   ├── models/
│   │   └── anime.py             # Pydantic data models
│   ├── services/
│   │   ├── data_service.py           # Data processing pipeline
│   │   ├── smart_scheduler.py        # Rate limiting coordination
│   │   ├── update_service.py         # Database update management
│   │   └── external/                 # External platform services
│   └── exceptions.py            # Custom exception classes
├── scripts/
│   ├── test_mcp.py              # MCP server testing client
│   ├── migrate_to_multivector.py # Collection migration script
│   ├── add_image_embeddings.py  # Image processing pipeline
│   └── test_mcp_server_comprehensive.py     # MCP functionality verification
├── data/
│   ├── raw/                     # Original anime database JSON
│   └── qdrant_storage/          # Qdrant vector database files
├── docker-compose.yml           # Service orchestration
└── requirements.txt             # Python dependencies

🚀 Quick Start

Prerequisites

Python 3.11+
Docker & Docker Compose
4GB+ RAM (for vector processing)

1. Clone and Setup

git clone <your-repo-url>
cd anime-mcp-server

# Create virtual environment
python -m venv venv
source venv/bin/activate  # Linux/Mac
# OR
venv\Scripts\activate     # Windows

# Install dependencies
pip install -r requirements.txt

2. Start Services

Full Stack (Recommended)

# Start complete stack with Docker
docker compose up -d

# Services will be available at:
# - FastAPI REST API: http://localhost:8000
# - MCP Server (HTTP): http://localhost:8001 (if mcp-server container is enabled)
# - Qdrant Vector DB: http://localhost:6333

Deployment Options:

REST API Only (Default)

# Start just FastAPI + Qdrant
docker compose up -d fastapi qdrant

REST API + MCP HTTP Server

# Start all services (REST API on :8000, MCP HTTP on :8001)
docker compose up -d

MCP stdio mode (for Claude Code integration)

# Start infrastructure only
docker compose up -d qdrant

# Run MCP server locally in stdio mode
python -m src.anime_mcp.modern_server

Local Development

# Start Qdrant vector database
docker compose up -d qdrant

# Create environment file
cat > .env << EOF
QDRANT_URL=http://localhost:6333
QDRANT_COLLECTION_NAME=anime_database
HOST=0.0.0.0
PORT=8000
DEBUG=True
ENABLE_MULTI_VECTOR=true
EOF

# Start FastAPI server
uvicorn src.main:app --host 0.0.0.0 --port 8000 --reload

3. Initialize Database (First Time Setup)

Production Indexing (Required for first run):

# Start the full-update process (runs in background)
curl -X POST http://localhost:8000/api/admin/update-full

# Monitor indexing progress (check logs)
docker compose logs fastapi --tail 50 -f

# Expected output shows batch progress:
# INFO - Uploaded batch 29/389 (100 points)
# ...continues until batch 389/389

Indexing Progress: The system will process all 38,894 anime entries with dual image vectors. This typically takes 2-3 hours depending on network speed for image downloads.

4. Verify System Status

# Check system health
curl http://localhost:8000/health
# Response: {"status":"healthy","qdrant":"connected","timestamp":"..."}

# Check database stats (after indexing completes)
curl http://localhost:8000/stats
# Response: {"total_documents":38894,"vector_size":384,"status":"green"}

# Check indexing status
curl http://localhost:8000/api/admin/update-status
# Response: {"entry_count":38894,"last_full_update":"2025-06-29T19:00:00Z"}

MCP Server Integration

Transport Protocols

The MCP servers support multiple transport protocols for different use cases:

Core Server (src.anime_mcp.server):

Protocol	Use Case	Port	Command
stdio	Local development, Claude Code	N/A	`python -m src.anime_mcp.server`
http	HTTP clients	8001	`python -m src.anime_mcp.server --mode http`
sse	Server-Sent Events, web clients	8001	`python -m src.anime_mcp.server --mode sse`
streamable	Streamable HTTP transport	8001	`python -m src.anime_mcp.server --mode streamable`

Modern Server (src.anime_mcp.modern_server):

Protocol	Use Case	Port	Command
stdio	Local development, Claude Code	N/A	`python -m src.anime_mcp.modern_server`
sse	Server-Sent Events, web clients	8001	`python -m src.anime_mcp.modern_server --mode sse`

Running the MCP Server

Core Server (All transport protocols):

# Local development (stdio) - default mode
python -m src.anime_mcp.server

# HTTP transport
python -m src.anime_mcp.server --mode http --host 0.0.0.0 --port 8001

# Server-Sent Events (SSE)
python -m src.anime_mcp.server --mode sse --host 0.0.0.0 --port 8001

# Streamable HTTP transport
python -m src.anime_mcp.server --mode streamable --host 0.0.0.0 --port 8001

# With verbose logging
python -m src.anime_mcp.server --mode sse --verbose

Modern Server (LangGraph workflows):

# Local development (stdio) - default mode
python -m src.anime_mcp.modern_server

# Server-Sent Events (SSE)
python -m src.anime_mcp.modern_server --mode sse --host 0.0.0.0 --port 8001

# With verbose logging
python -m src.anime_mcp.modern_server --mode sse --verbose

Testing:

# Test MCP functionality
python scripts/test_mcp_server_comprehensive.py

Configuration Options

MCP Server Configuration

Environment variables for MCP server:

# MCP Server Configuration
SERVER_MODE=stdio          # Core server: stdio, http, sse, streamable | Modern server: stdio, sse
MCP_HOST=0.0.0.0          # HTTP server host (for HTTP modes)
MCP_PORT=8001             # HTTP server port (for HTTP modes)

Modern Embedding Models Configuration

The system now supports modern embedding models for improved accuracy and performance:

# Text Embedding Configuration
TEXT_EMBEDDING_PROVIDER=fastembed     # Options: fastembed, huggingface, sentence-transformers
TEXT_EMBEDDING_MODEL=BAAI/bge-small-en-v1.5  # Model name for text embeddings
TEXT_EMBEDDING_MODEL_FALLBACK=BAAI/bge-base-en-v1.5  # Fallback model

# Image Embedding Configuration  
IMAGE_EMBEDDING_PROVIDER=clip         # Options: clip, siglip, jinaclip
IMAGE_EMBEDDING_MODEL=ViT-B/32        # Model name for image embeddings
IMAGE_EMBEDDING_MODEL_FALLBACK=ViT-L/14  # Fallback model

# SigLIP Configuration (improved zero-shot performance)
SIGLIP_MODEL=google/siglip-so400m-patch14-384
SIGLIP_INPUT_RESOLUTION=384           # Higher resolution than CLIP's 224

# JinaCLIP v2 Configuration (512x512 resolution, multilingual)
JINACLIP_MODEL=jinaai/jina-clip-v2
JINACLIP_INPUT_RESOLUTION=512         # 4x higher resolution than CLIP
JINACLIP_TEXT_MAX_LENGTH=77

# BGE Configuration (latest text embeddings)
BGE_MODEL_VERSION=v1.5               # Options: v1.5, m3, reranker
BGE_MODEL_SIZE=small                 # Options: small, base, large
BGE_ENABLE_MULTILINGUAL=false        # Enable BGE-M3 multilingual model
BGE_MAX_LENGTH=512

# Model Management
ENABLE_MODEL_FALLBACK=true           # Automatic fallback on model failure
MODEL_CACHE_DIR=/path/to/cache       # Custom model cache directory
MODEL_WARM_UP=true                   # Pre-load models during startup
ENABLE_LEGACY_MODEL_SUPPORT=true     # Maintain compatibility with existing vectors

Modern vs Legacy Models:

Feature	Legacy Models	Modern Models
Text Embedding	BGE-small-en-v1.5 (384d)	BGE-M3 (1024d, 100+ languages)
Image Embedding	CLIP ViT-B/32 (224px, 512d)	JinaCLIP v2 (512px, 768d)
Performance	3.5s avg response	<0.5s target with SigLIP
Accuracy	57% image search	25%+ improvement expected
Languages	English primarily	89 languages (JinaCLIP v2)
Resolution	224x224 pixels	512x512 pixels (4x detail)

Performance Benchmarking:

# Run embedding model benchmark
python scripts/benchmark_modern_embeddings.py

# Expected improvements:
# - SigLIP: 40% better zero-shot performance
# - JinaCLIP v2: 4x higher resolution, 98% Flickr30k accuracy  
# - BGE-M3: Multilingual support, 8192 token context

Client Integration

Claude Code (stdio mode)

{
  "mcpServers": {
    "anime-search": {
      "command": "python",
      "args": ["-m", "src.anime_mcp.modern_server"],
      "cwd": "/path/to/anime-mcp-server"
    }
  }
}

Web Clients (HTTP/SSE modes)

Core Server:

HTTP mode: python -m src.anime_mcp.server --mode http --port 8001
SSE mode: python -m src.anime_mcp.server --mode sse --port 8001
Streamable mode: python -m src.anime_mcp.server --mode streamable --port 8001

Modern Server:

SSE mode: python -m src.anime_mcp.modern_server --mode sse --port 8001

Endpoints:

MCP endpoint: http://localhost:8001/ (varies by transport)
Health check: curl http://localhost:8001/health (when available)

MCP Tools Available

Core Search Tools (Core Server):

Tool	Description	Parameters
`search_anime`	Semantic search with advanced filtering	`query`, `limit`, `genres`, `year_range`, `exclusions`
`get_anime_details`	Get detailed anime information by ID	`anime_id` (string)
`find_similar_anime`	Find similar anime by vector similarity	`anime_id`, `limit` (optional)
`get_anime_stats`	Database statistics and health info	None
`search_anime_by_image`	Visual similarity search with image	`image_data` (base64), `limit`
`find_visually_similar_anime`	Visual similarity by anime ID	`anime_id`, `limit` (optional)
`search_multimodal_anime`	Combined text and image search	`query`, `image_data`, `text_weight`, `limit`

Workflow Tools (Modern Server):

Tool	Description	Parameters
`discover_anime`	Intelligent multi-agent anime discovery	`query` (string), `user_preferences`
`get_currently_airing_anime`	Real-time broadcast schedules	`day_filter`, `timezone`, `platforms`
`find_similar_anime_workflow`	AI-powered similarity analysis	`reference_anime`, `similarity_mode`
`search_by_streaming_platform`	Platform-specific availability search	`platforms` (array), `content_filters`

Platform-Specific Tools:

Note: MAL and Jikan are now properly separated platforms with distinct APIs and capabilities.

Tool	Description	Platform
`search_anime_mal`	Official MAL API v2 with OAuth2 & field selection	MAL API v2
`get_anime_mal`	Get MAL anime details by ID	MAL API v2
`get_mal_seasonal_anime`	Get seasonal anime from MAL	MAL API v2
`search_anime_anilist`	AniList GraphQL search	AniList
`get_anime_anilist`	Get AniList anime details	AniList
`search_anime_kitsu`	Kitsu JSON:API search	Kitsu
`get_anime_kitsu`	Get Kitsu anime details	Kitsu
`search_streaming_platforms`	Search streaming platform availability	Kitsu
`search_anime_jikan`	Jikan API v4 with 17+ advanced parameters	Jikan API v4
`get_anime_jikan`	Get Jikan anime details	Jikan API v4
`get_jikan_seasonal`	Get seasonal anime from Jikan	Jikan API v4
`search_anime_schedule`	AnimeSchedule.net search	AnimeSchedule
`get_schedule_data`	Get detailed schedule data	AnimeSchedule
`get_currently_airing`	Get currently airing anime	AnimeSchedule
`anime_semantic_search`	Vector database semantic search	Vector DB
`anime_similar`	Vector similarity search	Vector DB
`anime_vector_stats`	Vector database statistics	Vector DB

Cross-Platform Enrichment Tools:

Tool	Description	Purpose
`compare_anime_ratings_cross_platform`	Compare ratings across platforms	Cross-platform analysis
`get_cross_platform_anime_data`	Aggregate data from multiple platforms	Data enrichment
`correlate_anime_across_platforms`	Find correlations between platforms	Data validation
`get_streaming_availability_multi_platform`	Multi-platform streaming info	Streaming discovery
`detect_platform_discrepancies`	Detect data inconsistencies	Quality assurance

MCP Resources

anime://server/capabilities - Server capabilities and available tools
anime://platforms/status - Status of all anime platform integrations
anime://workflow/architecture - LangGraph workflow architecture information

📡 API Reference

🏥 Core System Endpoints

Endpoint	Method	Purpose	Example
`/`	GET	API overview	`curl http://localhost:8000/`
`/health`	GET	Health check	`curl http://localhost:8000/health`
`/stats`	GET	Database stats	`curl http://localhost:8000/stats`

🔍 Unified Search Endpoints

Endpoint	Method	Purpose	Content Type	Parameters
`/api/search/`	POST	Unified search (supports both JSON and file uploads)	`application/json` or `multipart/form-data`	JSON body or form data: See examples below

Search Types (Auto-detected based on provided fields):

Search Type	Detection Logic	JSON Parameters	Form Parameters
Text Search	`query` only	`query` (required), `limit` (1-100, default: 20)	`query` (form field), `limit` (form field)
Similar Search	`anime_id` only	`anime_id` (required), `limit` (1-100, default: 20)	`anime_id` (form field), `limit` (form field)
Image Search	`image_data` or `image` file	`image_data` (base64), `limit` (1-100, default: 20)	`image` (file upload), `limit` (form field)
Visual Similarity	`anime_id` + `visual_similarity=true`	`anime_id`, `visual_similarity=true`, `limit`	`anime_id`, `visual_similarity=true`, `limit`
Multimodal	`query` + `image_data`/`image` file	`query`, `image_data`, `text_weight` (0.0-1.0), `limit`	`query`, `image` (file), `text_weight`, `limit`

JSON Search Examples (using /api/search/):

# Text search (query only)
curl -X POST http://localhost:8000/api/search/ \
  -H "Content-Type: application/json" \
  -d '{
    "query": "action adventure anime",
    "limit": 5
  }'

# Similar search (anime_id only)
curl -X POST http://localhost:8000/api/search/ \
  -H "Content-Type: application/json" \
  -d '{
    "anime_id": "cac1eeaeddf7",
    "limit": 5
  }'

# Image search with base64 data
curl -X POST http://localhost:8000/api/search/ \
  -H "Content-Type: application/json" \
  -d '{
    "image_data": "iVBORw0KGgoAAAANSUhEUgAAAAEAAAAB...",
    "limit": 5
  }'

# Visual similarity search (anime_id + visual_similarity flag)
curl -X POST http://localhost:8000/api/search/ \
  -H "Content-Type: application/json" \
  -d '{
    "anime_id": "cac1eeaeddf7",
    "visual_similarity": true,
    "limit": 5
  }'

# Multimodal search (query + image_data)
curl -X POST http://localhost:8000/api/search/ \
  -H "Content-Type: application/json" \
  -d '{
    "query": "mecha anime",
    "image_data": "iVBORw0KGgoAAAANSUhEUgAAAAEAAAAB...",
    "text_weight": 0.7,
    "limit": 10
  }'

File Upload Examples (using /api/search/ with form data):

# Text search with form data
curl -X POST http://localhost:8000/api/search/ \
  -F "query=action adventure anime" \
  -F "limit=5"

# Similar search with form data
curl -X POST http://localhost:8000/api/search/ \
  -F "anime_id=cac1eeaeddf7" \
  -F "limit=5"

# Image search with file upload
curl -X POST http://localhost:8000/api/search/ \
  -F "image=@anime_poster.jpg" \
  -F "limit=5"

# Visual similarity search with form data
curl -X POST http://localhost:8000/api/search/ \
  -F "anime_id=cac1eeaeddf7" \
  -F "visual_similarity=true" \
  -F "limit=5"

# Multimodal search with file upload
curl -X POST http://localhost:8000/api/search/ \
  -F "query=mecha anime" \
  -F "image=@robot_poster.jpg" \
  -F "text_weight=0.7" \
  -F "limit=10"

Advanced Search Examples:

# Search by genre
curl -X POST http://localhost:8000/api/search/ \
  -H "Content-Type: application/json" \
  -d '{"query": "action adventure", "limit": 5}'

# Search by studio
curl -X POST http://localhost:8000/api/search/ \
  -H "Content-Type: application/json" \
  -d '{"query": "studio ghibli", "limit": 3}'

# Search by theme
curl -X POST http://localhost:8000/api/search/ \
  -H "Content-Type: application/json" \
  -d '{"query": "romantic comedy", "limit": 5}'

# Complex semantic search
curl -X POST http://localhost:8000/api/search/ \
  -H "Content-Type: application/json" \
  -d '{
    "query": "mecha robots fighting in space",
    "limit": 10
  }'

🤖 Conversational Workflow Endpoints

Endpoint	Method	Purpose	Example
`/api/workflow/conversation`	POST	Start/continue conversation	Standard conversation flows
`/api/workflow/smart-conversation`	POST	Smart orchestration workflow	Advanced multi-step query processing
`/api/workflow/multimodal`	POST	Multimodal conversation	Text + image conversation
`/api/workflow/conversation/{id}`	GET	Get conversation history	Retrieve session with summary
`/api/workflow/conversation/{id}`	DELETE	Delete conversation	Remove conversation session
`/api/workflow/stats`	GET	Workflow statistics	Get conversation metrics
`/api/workflow/health`	GET	Workflow system health	Check LangGraph engine status

AI-Powered Query Understanding:

The system now features intelligent natural language processing that automatically extracts search parameters from user queries:

# AI-powered natural language understanding
curl -X POST http://localhost:8000/api/workflow/smart-conversation \
  -H "Content-Type: application/json" \
  -d '{"message": "find me 5 mecha anime from 2020s but not too violent"}'

# Response includes extracted parameters:
# {
#   "current_context": {
#     "query": "mecha anime",           # Cleaned query
#     "limit": 5,                      # Extracted from "find me 5"
#     "filters": {
#       "year_range": [2020, 2029],    # From "2020s"
#       "genres": ["mecha"],           # Detected genre
#       "exclusions": ["violent"]      # From "but not too violent"
#     }
#   }
# }

Conversational Workflow Examples:

# Start standard conversation
curl -X POST http://localhost:8000/api/workflow/conversation \
  -H "Content-Type: application/json" \
  -d '{"message": "Find me some good action anime"}'

# Smart orchestration for complex queries
curl -X POST http://localhost:8000/api/workflow/smart-conversation \
  -H "Content-Type: application/json" \
  -d '{
    "message": "find action anime but not horror and similar to attack on titan",
    "enable_smart_orchestration": true,
    "max_discovery_depth": 3
  }'

# Continue existing conversation
curl -X POST http://localhost:8000/api/workflow/conversation \
  -H "Content-Type: application/json" \
  -d '{
    "message": "Find something similar but more romantic",
    "session_id": "existing-session-id"
  }'

# Multimodal conversation with image
curl -X POST http://localhost:8000/api/workflow/multimodal \
  -H "Content-Type: application/json" \
  -d '{
    "message": "Find anime similar to this image",
    "image_data": "base64_encoded_image_data",
    "text_weight": 0.7
  }'

# Get conversation history
curl http://localhost:8000/api/workflow/conversation/session-id

# Check workflow system health
curl http://localhost:8000/api/workflow/health

⚙️ Admin Endpoints

Endpoint	Method	Purpose	Example
`/api/admin/check-updates`	POST	Check for updates	`curl -X POST http://localhost:8000/api/admin/check-updates`
`/api/admin/update-incremental`	POST	Perform incremental update	`curl -X POST http://localhost:8000/api/admin/update-incremental`
`/api/admin/update-full`	POST	Perform full database update	`curl -X POST http://localhost:8000/api/admin/update-full`
`/api/admin/update-status`	GET	Get update status	`curl http://localhost:8000/api/admin/update-status`
`/api/admin/schedule-weekly-update`	POST	Schedule weekly updates	`curl -X POST http://localhost:8000/api/admin/schedule-weekly-update`
`/api/admin/smart-schedule-analysis`	GET	Get smart schedule analysis	`curl http://localhost:8000/api/admin/smart-schedule-analysis`
`/api/admin/update-safety-check`	GET	Check update safety	`curl http://localhost:8000/api/admin/update-safety-check`
`/api/admin/smart-update`	POST	Perform smart update	`curl -X POST http://localhost:8000/api/admin/smart-update`

🎯 Response Formats

Standard Search Response

{
  "query": "dragon ball",
  "results": [
    {
      "anime_id": "cac1eeaeddf7",
      "title": "Dragon Ball Z", 
      "synopsis": "Description text",
      "type": "TV",
      "episodes": 291,
      "tags": ["action", "adventure", "fighting"],
      "studios": ["toei animation co., ltd."],
      "picture": "https://cdn.myanimelist.net/images/anime/1277/142022.jpg",
      "score": 0.7822759,
      "year": 1989,
      "season": "spring",
      "myanimelist_id": 813,
      "anilist_id": 813
    }
  ],
  "total_results": 1,
  "processing_time_ms": 45.2
}

Workflow Response Structure

{
  "session_id": "string - Session identifier",
  "messages": "array - Conversation history", 
  "workflow_steps": "array - Executed workflow steps",
  "current_context": {
    "query": "string - Processed query",
    "limit": "integer - Result limit",
    "filters": "object - Applied filters", 
    "results": "array[AnimeResult] - Search results"
  },
  "user_preferences": "object - Learned user preferences"
}

Error Response Structure

{
  "error": "string - Error message",
  "detail": "string - Detailed error information", 
  "status_code": "integer - HTTP status code"
}

🔧 API Constraints & Limits

Search Limit: 1-50 results per request
Image Size: Max 10MB for image uploads
Session Timeout: 1 hour of inactivity
Query Length: Max 500 characters
Concurrent Requests: 10 per client
Text Weight: 0.0-1.0 (multimodal searches)

🎛️ Query Filters & AI Understanding

AI-Extracted Filter Patterns

The system automatically extracts these filters from natural language:

{
  "filters": {
    "year_range": [2020, 2029],        // From "2020s"
    "year": 2019,                      // From "2019" 
    "genres": ["mecha", "action"],     // From "mecha action anime"
    "exclusions": ["horror", "violent"], // From "but not horror or violent"
    "studios": ["Studio Ghibli"],      // From "Studio Ghibli movies"
    "anime_types": ["Movie"],          // From "movies" or "films"
    "mood": ["light", "funny"]         // From "light-hearted" or "funny"
  }
}

Manual Filter Syntax

For direct API calls, use these patterns:

"mecha anime 2020s -horror" → Mecha from 2020s, exclude horror
"Studio Ghibli movies" → Studio Ghibli movies only
"action adventure TV series" → Action adventure TV series

🧪 Testing

Postman Collection

A comprehensive Postman collection is available with all 76 requests organized by category:

Import the collection:

Collection: postman/Anime_MCP_Server_Complete.postman_collection.json
Environment: postman/Anime_MCP_Server_Local.postman_environment.json

Collection Structure:

Core System (3 requests): Health, stats, API overview
Unified Search (8 requests): Text, similar, image, visual similarity, multimodal, file upload variations
Query (2 requests): AI-powered query processing
Admin (8 requests): Database management endpoints
External Platforms (55 requests): AniList, MAL, Kitsu, AnimeSchedule, AniDB, AnimePlanet, AniSearch, AnimeCountdown

Environment Variables:

baseUrl: http://localhost:8000
anime_id: cac1eeaeddf7 (example anime ID)
staff_id: 95 (example staff ID)
studio_id: 11 (example studio ID)

FastAPI Server Testing

# Health check
curl http://localhost:8000/health

# Test search
curl -X POST http://localhost:8000/api/search/ \
  -H "Content-Type: application/json" \
  -d '{"query": "dragon ball", "limit": 5}'

# Stats
curl http://localhost:8000/stats

🔬 API Testing Sequences

Basic API Testing Sequence

Health Check → Database Stats → Simple Search → Semantic Search

# 1. Health check
curl http://localhost:8000/health

# 2. Database stats  
curl http://localhost:8000/stats

# 3. Simple search
curl -X POST http://localhost:8000/api/search/ \
  -H "Content-Type: application/json" \
  -d '{"query": "dragon ball", "limit": 5}'

# 4. Semantic search
curl -X POST http://localhost:8000/api/search/ \
  -H "Content-Type: application/json" \
  -d '{"query": "mecha robots fighting in space", "limit": 10}'

AI Query Understanding Testing

Smart Conversation with limit extraction
Complex query with multiple filters
Studio + year + exclusion query
Verify extracted parameters in response

# Test natural language parameter extraction
curl -X POST http://localhost:8000/api/workflow/smart-conversation \
  -H "Content-Type: application/json" \
  -d '{"message": "find me 5 mecha anime from 2020s but not too violent"}'

# Test studio + year extraction
curl -X POST http://localhost:8000/api/workflow/smart-conversation \
  -H "Content-Type: application/json" \
  -d '{"message": "show me top 3 Studio Ghibli movies from 90s"}'

# Test complex exclusions
curl -X POST http://localhost:8000/api/workflow/smart-conversation \
  -H "Content-Type: application/json" \
  -d '{"message": "find action adventure anime but not romance or horror"}'

Multimodal Testing Sequence

Base64 image search
Multimodal conversation with text + image
Visual similarity search
Verify image and text weights

# Image search with base64 data
curl -X POST http://localhost:8000/api/search/ \
  -H "Content-Type: application/json" \
  -d '{"image_data": "iVBORw0KGgoAAAANSUhEUgAAAAEAAAAB...", "limit": 5}'

# Visual similarity search
curl -X POST http://localhost:8000/api/search/ \
  -H "Content-Type: application/json" \
  -d '{"anime_id": "cac1eeaeddf7", "visual_similarity": true, "limit": 5}'

# Multimodal workflow
curl -X POST http://localhost:8000/api/workflow/multimodal \
  -H "Content-Type: application/json" \
  -d '{"message": "find anime similar to this style", "image_data": "base64_data", "text_weight": 0.7}'

Workflow Testing Sequence

Standard conversation
Smart orchestration with complex query
Multimodal workflow
Session management (create, retrieve, delete)

# Standard conversation
curl -X POST http://localhost:8000/api/workflow/conversation \
  -H "Content-Type: application/json" \
  -d '{"message": "Find me some good action anime"}'

# Smart orchestration with complex query
curl -X POST http://localhost:8000/api/workflow/smart-conversation \
  -H "Content-Type: application/json" \
  -d '{
    "message": "find mecha anime but not too violent and similar to gundam",
    "enable_smart_orchestration": true,
    "max_discovery_depth": 3
  }'

# Session management
curl http://localhost:8000/api/workflow/conversation/session-id  # Get history
curl -X DELETE http://localhost:8000/api/workflow/conversation/session-id  # Delete session

MCP Server Testing

Comprehensive Testing

# Test MCP server (comprehensive verification)
python scripts/test_mcp_server_comprehensive.py

# With detailed output and image testing
python scripts/test_mcp_server_comprehensive.py --detailed

# Skip image tests (if CLIP not available)
python scripts/test_mcp_server_comprehensive.py --skip-image-tests

# Expected output:
# Starting comprehensive FastMCP Anime Server verification...
# MCP session initialized
# Available tools: ['search_anime', 'get_anime_details', 'find_similar_anime', 'get_anime_stats', 'search_anime_by_image', 'find_visually_similar_anime', 'search_multimodal_anime']
# All expected MCP tools are available
# Basic search test successful
# Stats test successful
# Testing image search functionality...
# Image search returned 3 results
# Visual similarity search returned 2 results
# Multimodal search returned 3 results
# Testing database health and statistics...
# Qdrant connection verified
# Total anime entries: 38,894
# All tests completed successfully!

Protocol-Specific Testing

stdio mode (local development):

# Start MCP server in one terminal
source venv/bin/activate
python -m src.anime_mcp.modern_server

# Use the automated test script (recommended approach)
python scripts/test_mcp_server_comprehensive.py

SSE mode (web/remote access):

# Start SSE MCP server
python -m src.anime_mcp.modern_server --mode sse --port 8001

# Test SSE endpoint accessibility
curl http://localhost:8001/sse/

# Test with compatible MCP clients
# Endpoint: http://localhost:8001/sse/

Unit Tests

# Run full test suite
python run_tests.py

# Run specific test categories
pytest tests/unit/ -v
pytest tests/integration/ -v

# Test AI-powered query understanding
pytest tests/unit/services/test_llm_service.py -v
pytest tests/unit/langgraph/test_llm_integration.py -v

# Test smart orchestration features
pytest tests/unit/langgraph/test_smart_orchestration.py -v

# Run tests with coverage
pytest tests/ --cov=src --cov-report=html

🔧 Configuration

Environment Variables

# Qdrant Configuration
QDRANT_URL=http://localhost:6333          # Vector database URL
QDRANT_COLLECTION_NAME=anime_database     # Collection name

# Server Configuration
HOST=0.0.0.0                              # FastAPI bind host
PORT=8000                                 # FastAPI port
DEBUG=True                                # Debug mode

# Vector Search Configuration
FASTEMBED_MODEL=BAAI/bge-small-en-v1.5    # Embedding model
QDRANT_VECTOR_SIZE=384                    # Vector dimensions
QDRANT_DISTANCE_METRIC=cosine             # Distance function

# Multi-Vector Configuration (Image Search)
ENABLE_MULTI_VECTOR=true                  # Enable image search features
IMAGE_VECTOR_SIZE=512                     # CLIP embedding dimensions
CLIP_MODEL=ViT-B/32                       # CLIP model for image processing

# API Configuration
API_TITLE=Anime MCP Server                # API title
API_VERSION=1.0.0                         # API version
ALLOWED_ORIGINS=*                         # CORS origins

# LLM Configuration (AI-Powered Query Understanding)
OPENAI_API_KEY=your_openai_key_here       # OpenAI API key for intelligent query parsing
ANTHROPIC_API_KEY=your_anthropic_key_here # Anthropic API key (alternative)
LLM_PROVIDER=openai                       # Default LLM provider: openai, anthropic

Docker Configuration

The system uses Docker Compose for orchestration with support for both REST API and MCP server:

Port Allocation:

8000: FastAPI REST API
8001: MCP HTTP Server (when enabled)
6333: Qdrant Vector Database
6334: Qdrant gRPC (internal)

📊 Performance

Search Speed: Sub-200ms text search, ~1s image search response times
Workflow Performance: 150ms target response time (improved from 200ms via ToolNode optimization)
AI Query Understanding: ~500ms LLM response time with structured output parsing
Smart Orchestration: 50ms average response time (faster than standard workflows)
Vector Models:
- Text: BAAI/bge-small-en-v1.5 (384-dimensional embeddings)
- Image: CLIP ViT-B/32 (512-dimensional embeddings)
- LLM: OpenAI GPT-4o-mini / Anthropic Claude Haiku for query understanding
Database Size: 38,894 anime entries with dual image vectors (picture + thumbnail)
Memory Usage: ~3-4GB for full dataset with CLIP image embeddings
Indexing Time: 2-3 hours for complete database with image downloads
Vector Storage: Text (384D) + Picture (512D) + Thumbnail (512D) per anime
Concurrency: Supports multiple simultaneous searches and complex query processing
MCP Protocol: Full FastMCP 2.8.1 integration with 8 core tools + 4 workflow tools + 14 platform tools + 5 enrichment tools (31 total)
Workflow Processing: 2-5 workflow steps per query depending on complexity
Natural Language Processing: Intelligent parameter extraction with graceful fallbacks

🔄 Data Pipeline

Source Data

Provider: anime-offline-database
Format: Comprehensive JSON with cross-platform references
Coverage: MyAnimeList, AniList, Kitsu, AniDB, and 7 other sources
Total Entries: 38,894 anime with rich metadata

Processing Steps

Download: Fetch latest anime-offline-database JSON
Validation: Parse and validate entries with Pydantic models
Enhancement: Extract platform IDs, calculate quality scores
Text Vectorization: Create embeddings from title + synopsis + tags + studios
Image Processing: Download poster images and generate CLIP embeddings
Indexing: Store in Qdrant multi-vector collection with optimized batch processing

Database Schema

Each anime entry contains:

Basic Info: title, type, episodes, status, year, season
Metadata: synopsis, tags, studios, producers, picture URLs
Platform IDs: MyAnimeList, AniList, Kitsu, AniDB, etc.
Search Fields: embedding_text, search_text
Vector Embeddings: text (384-dim) + image (512-dim) in multi-vector collection
Quality Score: Data completeness rating (0-1)

🎮 Interactive Testing

Web Interface

Visit http://localhost:8000 for:

FastAPI automatic documentation (Swagger UI)
Interactive API testing
Real-time endpoint exploration

MCP Testing

# Test MCP server communication
python scripts/test_mcp.py

# Start MCP server for AI integration
python -m src.anime_mcp.server

🛠️ Development

Important Scripts

# MCP Server Management
python -m src.anime_mcp.modern_server                            # Start MCP server (stdio mode)
python -m src.anime_mcp.modern_server --mode sse --port 8001     # Start MCP server (SSE mode)
python -m src.anime_mcp.modern_server --help                     # View all CLI options

# Data Management
python scripts/migrate_to_multivector.py --dry-run    # Test collection migration
python scripts/migrate_to_multivector.py             # Migrate to multi-vector
python scripts/add_image_embeddings.py --batch-size 100  # Process image embeddings

# Testing & Verification
python scripts/test_mcp_server_comprehensive.py                 # Comprehensive MCP server testing
python scripts/test_mcp_server_comprehensive.py --detailed      # Detailed testing with sample data
python scripts/test_mcp_server_comprehensive.py --skip-image-tests  # Skip image tests if CLIP unavailable
python run_tests.py                                 # Run full test suite

# Data Pipeline
curl -X POST http://localhost:8000/api/admin/update-full    # Full database update
curl -X POST http://localhost:8000/api/admin/update-incremental  # Incremental update

Code Quality & Formatting

# Code formatting and linting (recommended order)
autoflake --recursive --in-place --remove-all-unused-imports --remove-unused-variables src/ tests/ scripts/
isort src/ tests/ scripts/
black src/ tests/ scripts/

# Check formatting (CI/pre-commit)
autoflake --check --recursive --remove-all-unused-imports src/ tests/ scripts/
isort --check-only src/ tests/ scripts/
black --check src/ tests/ scripts/

# Type checking
mypy src/

# Run tests
pytest tests/ -v

Formatting Tools Configuration:

Black: Code style formatting (88 char line length, Python 3.11+ target)
isort: Import organization (Black-compatible profile)
autoflake: Unused import removal (safe settings, preserves init.py imports)

All tools configured in pyproject.toml with modern best practices and compatibility.

Project Structure

src/main.py: FastAPI application entry point
src/anime_mcp/server.py: Core FastMCP server with 8 tools + 2 resources
src/anime_mcp/modern_server.py: Modern MCP server with LangGraph workflows
src/vector/qdrant_client.py: Multi-vector database operations with CLIP
src/vector/vision_processor.py: CLIP image processing pipeline
src/config.py: Centralized configuration management
src/services/llm_service.py: AI-powered query understanding service
scripts/test_mcp.py: MCP server testing client

🔮 Technology Stack

Backend: FastAPI + Python 3.11+
Vector Database: Qdrant 1.11.3 (multi-vector support)
Text Embeddings: FastEmbed (BAAI/bge-small-en-v1.5)
Image Embeddings: CLIP (ViT-B/32)
AI Integration: OpenAI GPT-4o-mini / Anthropic Claude for query understanding
Workflow Engine: LangGraph with native ToolNode integration for conversation orchestration
MCP Integration: FastMCP 2.8.1
Image Processing: PIL + torch + CLIP
Containerization: Docker + Docker Compose
Data Validation: Pydantic v2
Testing: pytest + httpx

📝 License

This project is licensed under the MIT License - see the LICENSE file for details.

🙏 Acknowledgments

anime-offline-database for comprehensive anime data
Qdrant for vector search capabilities
FastAPI for the web framework
FastMCP for MCP protocol integration

Troubleshooting

Common Issues

Port Conflicts

# Check if port is in use
netstat -tulpn | grep :8001

# Use different port
python -m src.anime_mcp.modern_server --mode sse --port 8002

MCP Connection Issues

# Check server logs with verbose output
python -m src.anime_mcp.modern_server --mode sse --verbose

# Verify Qdrant connection
curl http://localhost:6333/health

Docker Issues

# Check container logs
docker compose logs fastapi --tail 50
docker compose logs mcp-server --tail 50
docker compose logs qdrant --tail 50

# Monitor real-time logs (useful during indexing)
docker compose logs fastapi -f

# Check container status
docker compose ps

# Restart specific service
docker compose restart fastapi
docker compose restart mcp-server

# Rebuild containers after code changes
docker compose build --no-cache
docker compose up -d --force-recreate

Indexing Issues

# Check indexing progress
curl http://localhost:8000/api/admin/update-status

# Monitor indexing in real-time
docker compose logs fastapi -f | grep "batch"

# Check database statistics
curl http://localhost:8000/stats

# Restart indexing if it fails
curl -X POST http://localhost:8000/api/admin/update-full

Invalid Transport Mode

# Core server valid modes: stdio, http, sse, streamable
python -m src.anime_mcp.server --mode invalid
# Error: argument --mode: invalid choice: 'invalid'

# Modern server valid modes: stdio, sse
python -m src.anime_mcp.modern_server --mode invalid
# Error: argument --mode: invalid choice: 'invalid'

Support

Issues: Report bugs via GitHub Issues
Discussions: Community discussions in GitHub Discussions
Documentation: Full docs at /docs endpoint when server is running
Development: See CLAUDE.md for detailed development guidance

Status: ✅ Production Ready - Complete anime search system with multi-modal capabilities, vector database, FastMCP integration, and comprehensive REST API.

Anime MCP Server

Anime MCP Server

Features

🏗️ Architecture

🚀 Quick Start

Prerequisites

1. Clone and Setup

2. Start Services

3. Initialize Database (First Time Setup)

4. Verify System Status

MCP Server Integration

Transport Protocols

Running the MCP Server

Configuration Options

MCP Server Configuration

Modern Embedding Models Configuration

Client Integration

MCP Tools Available

MCP Resources

📡 API Reference

🏥 Core System Endpoints

🔍 Unified Search Endpoints

🤖 Conversational Workflow Endpoints

⚙️ Admin Endpoints

🎯 Response Formats

Standard Search Response

Workflow Response Structure

Error Response Structure

🔧 API Constraints & Limits

🎛️ Query Filters & AI Understanding

AI-Extracted Filter Patterns

Manual Filter Syntax

🧪 Testing

Postman Collection

FastAPI Server Testing

🔬 API Testing Sequences

Basic API Testing Sequence

AI Query Understanding Testing

Multimodal Testing Sequence

Workflow Testing Sequence

MCP Server Testing

Unit Tests

🔧 Configuration

Environment Variables

Docker Configuration

📊 Performance

🔄 Data Pipeline

Source Data

Processing Steps

Database Schema

🎮 Interactive Testing

Web Interface

MCP Testing

🛠️ Development

Important Scripts

Code Quality & Formatting

Project Structure

🔮 Technology Stack

📝 License

🙏 Acknowledgments

Troubleshooting

Common Issues

Support

Related Servers

Brave Search

招投标大数据服务

Gemini Web Search

Perplexity MCP Server

NameChecker

Wolfram Alpha

YouTube Toolbox

StatPearls

Searchcraft

Unsplash MCP Server