Simple Files Vectorstore
Provides semantic search across local files by creating vector embeddings from watched directories.
@lishenxydlgzs/simple-files-vectorstore
A Model Context Protocol (MCP) server that provides semantic search capabilities across files. This server watches specified directories and creates vector embeddings of file contents, enabling semantic search across your documents.
Installation & Usage
Add to your MCP settings file:
{
"mcpServers": {
"files-vectorstore": {
"command": "npx",
"args": [
"-y",
"@lishenxydlgzs/simple-files-vectorstore"
],
"env": {
"WATCH_DIRECTORIES": "/path/to/your/directories"
},
"disabled": false,
"autoApprove": []
}
}
}
MCP settings file locations:
- VSCode Cline Extension:
~/Library/Application Support/Code/User/globalStorage/saoudrizwan.claude-dev/settings/cline_mcp_settings.json - Claude Desktop App:
~/Library/Application Support/Claude/claude_desktop_config.json
Configuration
The server requires configuration through environment variables:
Required Environment Variables
You must specify directories to watch using ONE of the following methods:
WATCH_DIRECTORIES: Comma-separated list of directories to watchWATCH_CONFIG_FILE: Path to a JSON configuration file with awatchListarray
Example using WATCH_DIRECTORIES:
{
"mcpServers": {
"files-vectorstore": {
"command": "npx",
"args": [
"-y",
"@lishenxydlgzs/simple-files-vectorstore"
],
"env": {
"WATCH_DIRECTORIES": "/path/to/dir1,/path/to/dir2"
},
"disabled": false,
"autoApprove": []
}
}
}
Example using WATCH_CONFIG_FILE:
{
"mcpServers": {
"files-vectorstore": {
"command": "npx",
"args": [
"-y",
"@lishenxydlgzs/simple-files-vectorstore"
],
"env": {
"WATCH_CONFIG_FILE": "/path/to/watch-config.json"
},
"disabled": false,
"autoApprove": []
}
}
}
The watch config file should have the following structure:
{
"watchList": [
"/path/to/dir1",
"/path/to/dir2",
"/path/to/specific/file.txt"
]
}
Optional Environment Variables
CHUNK_SIZE: Size of text chunks for processing (default: 1000)CHUNK_OVERLAP: Overlap between chunks (default: 200)IGNORE_FILE: Path to a .gitignore style file to exclude files/directories based on patterns
Example with all optional parameters:
{
"mcpServers": {
"files-vectorstore": {
"command": "npx",
"args": [
"-y",
"@lishenxydlgzs/simple-files-vectorstore"
],
"env": {
"WATCH_DIRECTORIES": "/path/to/dir1,/path/to/dir2",
"CHUNK_SIZE": "2000",
"CHUNK_OVERLAP": "500",
"IGNORE_FILE": "/path/to/.gitignore"
},
"disabled": false,
"autoApprove": []
}
}
}
MCP Tools
This server provides the following MCP tools:
1. search
Perform semantic search across indexed files.
Parameters:
query(required): The search query stringlimit(optional): Maximum number of results to return (default: 5, max: 20)
Example response:
[
{
"content": "matched text content",
"source": "/path/to/file",
"fileType": "markdown",
"score": 0.85
}
]
2. get_stats
Get statistics about indexed files.
Parameters: None
Example response:
{
"totalDocuments": 42,
"watchedDirectories": ["/path/to/docs"],
"processingFiles": []
}
Features
- Real-time file watching and indexing
- Semantic search using vector embeddings
- Support for multiple file types
- Configurable chunk size and overlap
- Background processing of files
- Automatic handling of file changes and deletions
Repository
相关服务器
Stack Overflow
Access Stack Overflow's trusted and verified technical questions and answers.
EntRoute MCP
MCP Server for AI agents to discover and call pay-per-request APIs via EntRoute. Give Claude, Cursor, Windsurf, or any MCP-compatible agent access to 350+ verified x402 API endpoints across 110+ capabilities — DeFi prices, web search, prediction markets, news, and more.
Plex MCP Server
Search your Plex media library. Supports OAuth and static token authentication.
SEC Filings and Earnings Call
The MCP server provides end-to-end workflows for SEC filings and earnings call transcripts—including ticker resolution, document retrieval, OCR, embedding, on-disk resource discovery, and semantic search—exposed via MCP and powered by the same olmOCR and embedding backends as the vLLM backends.
Agently MCP
Discover public A2A agents on the Agently platform using its public API.
MCP Web Search Tool
A server for real-time web search using pluggable providers, powered by the Brave Search API.
Google Search by CData
An MCP server for Google Search provided by CData, which requires an external CData JDBC Driver.
news-aggregator-mcp-server
Multi-source news aggregation for AI agents — RSS/Atom feeds (16 sources), HackerNews, and GDELT global news intelligence in 65+ languages. No API key required.
hackernews
A simple MCP server that brings Hacker News into your AI workflows. It exposes a set of tools to fetch top stories, individual posts with comments, and the latest Ask HN / Show HN discussions — all in a clean, structured format that’s easy for agents
Pollinations Think
Strategic thinking and real-time web search using the Pollinations AI API with DeepSeek and SearchGPT models.