Simple Files Vectorstore
Provides semantic search across local files by creating vector embeddings from watched directories.
@lishenxydlgzs/simple-files-vectorstore
A Model Context Protocol (MCP) server that provides semantic search capabilities across files. This server watches specified directories and creates vector embeddings of file contents, enabling semantic search across your documents.
Installation & Usage
Add to your MCP settings file:
{
"mcpServers": {
"files-vectorstore": {
"command": "npx",
"args": [
"-y",
"@lishenxydlgzs/simple-files-vectorstore"
],
"env": {
"WATCH_DIRECTORIES": "/path/to/your/directories"
},
"disabled": false,
"autoApprove": []
}
}
}
MCP settings file locations:
- VSCode Cline Extension:
~/Library/Application Support/Code/User/globalStorage/saoudrizwan.claude-dev/settings/cline_mcp_settings.json - Claude Desktop App:
~/Library/Application Support/Claude/claude_desktop_config.json
Configuration
The server requires configuration through environment variables:
Required Environment Variables
You must specify directories to watch using ONE of the following methods:
WATCH_DIRECTORIES: Comma-separated list of directories to watchWATCH_CONFIG_FILE: Path to a JSON configuration file with awatchListarray
Example using WATCH_DIRECTORIES:
{
"mcpServers": {
"files-vectorstore": {
"command": "npx",
"args": [
"-y",
"@lishenxydlgzs/simple-files-vectorstore"
],
"env": {
"WATCH_DIRECTORIES": "/path/to/dir1,/path/to/dir2"
},
"disabled": false,
"autoApprove": []
}
}
}
Example using WATCH_CONFIG_FILE:
{
"mcpServers": {
"files-vectorstore": {
"command": "npx",
"args": [
"-y",
"@lishenxydlgzs/simple-files-vectorstore"
],
"env": {
"WATCH_CONFIG_FILE": "/path/to/watch-config.json"
},
"disabled": false,
"autoApprove": []
}
}
}
The watch config file should have the following structure:
{
"watchList": [
"/path/to/dir1",
"/path/to/dir2",
"/path/to/specific/file.txt"
]
}
Optional Environment Variables
CHUNK_SIZE: Size of text chunks for processing (default: 1000)CHUNK_OVERLAP: Overlap between chunks (default: 200)IGNORE_FILE: Path to a .gitignore style file to exclude files/directories based on patterns
Example with all optional parameters:
{
"mcpServers": {
"files-vectorstore": {
"command": "npx",
"args": [
"-y",
"@lishenxydlgzs/simple-files-vectorstore"
],
"env": {
"WATCH_DIRECTORIES": "/path/to/dir1,/path/to/dir2",
"CHUNK_SIZE": "2000",
"CHUNK_OVERLAP": "500",
"IGNORE_FILE": "/path/to/.gitignore"
},
"disabled": false,
"autoApprove": []
}
}
}
MCP Tools
This server provides the following MCP tools:
1. search
Perform semantic search across indexed files.
Parameters:
query(required): The search query stringlimit(optional): Maximum number of results to return (default: 5, max: 20)
Example response:
[
{
"content": "matched text content",
"source": "/path/to/file",
"fileType": "markdown",
"score": 0.85
}
]
2. get_stats
Get statistics about indexed files.
Parameters: None
Example response:
{
"totalDocuments": 42,
"watchedDirectories": ["/path/to/docs"],
"processingFiles": []
}
Features
- Real-time file watching and indexing
- Semantic search using vector embeddings
- Support for multiple file types
- Configurable chunk size and overlap
- Background processing of files
- Automatic handling of file changes and deletions
Repository
Related Servers
Fuel Network & Sway Language
Semantic search for Fuel Network and Sway Language documentation using a local vector database.
Movie Search Tool
A search tool for movies and TV shows that provides AI clients with validated, playable video links and supports real-time notifications.
Google News
Google News search capabilities with automatic topic categorization and multi-language support via SerpAPI integration.
Rakuten Travel
Search for hotels and check their availability using the Rakuten Travel API.
Marginalia Search
A search engine for non-commercial content and hidden gems of the internet.
Azure AI Search
Search content using Azure AI Agent Service and Azure AI Search.
Perigon MCP Server
Official MCP server for the Perigon API, providing access to real-time news and media data.
招投标大数据服务
Provides comprehensive import and export trade data query functions, including trend analysis, product statistics, and geographic distribution.
Sci-Hub MCP Server
Search and access academic papers from Sci-Hub by DOI, title, or keyword.
Algolia Search
A server for searching an Algolia index using the Algolia Search API.