kokoro-tts-mcp-server

Ross Technologies Logo

MCP server for text-to-speech using Kokoro TTS

100% Local • No Python Required • Female Voice Default

📦 NPM Package | 🐙 GitHub Repository | 📚 Documentation

Overview

A production-ready MCP server that provides text-to-speech capabilities using the Kokoro TTS model. Features a default female voice (af_heart) and runs 100% locally using native JavaScript, eliminating Python dependencies.

Built by Ross Technologies
📍 Beer Sheva, Israel | 📧 devops.ross@gmail.com

Features

✅ 100% Local - No external API calls, complete privacy
✅ Native JavaScript - Built with TypeScript and Node.js
✅ SSE & Stdio Support - Multiple transport modes
✅ NPX Ready - Run directly without installation
✅ Female Voice Default - Uses af_heart voice out of the box
✅ Auto Audio Playback - Automatically saves and plays generated audio

Quick Start

# Run with npx (no installation)
npx @ross_tchnologies/kokoro-tts-mcp-server

# Or install globally
npm install -g @ross_tchnologies/kokoro-tts-mcp-server
kokoro-tts-mcp

Installation

NPM

npm install @ross_tchnologies/kokoro-tts-mcp-server

NPX (No Installation)

npx @ross_tchnologies/kokoro-tts-mcp-server

Global Installation

npm install -g @ross_tchnologies/kokoro-tts-mcp-server

Visit Installation Guide for detailed instructions.

MCP Client Configuration

Add to your MCP configuration file (e.g., ~/.cursor/mcp.json):

{
  "mcpServers": {
    "kokoro-tts": {
      "command": "npx",
      "args": ["-y", "@ross_tchnologies/kokoro-tts-mcp-server"]
    }
  }
}

See Configuration Guide for more options.

Usage

Text to Speech Tool

{
  "name": "text_to_speech",
  "arguments": {
    "text": "Hello, world!",
    "voice": "af_heart",  // optional, default
    "speed": 1.0          // optional, default
  }
}

Available Voices: af_heart (default), af_bella, af_sarah, and more.

See API Reference for complete documentation.

Documentation

📚 Full Documentation Wiki

Quick Links:

Requirements

Node.js v18 or higher
No Python or other external dependencies required! 🎉

Troubleshooting

First run: Downloads ~300MB model (one-time, takes 2-5 minutes)
Audio not playing: File is saved - check response message for location
WSL users: Copy audio files to Windows Desktop for playback

See Troubleshooting Guide for detailed solutions.

Contributing

Contributions are welcome! See:

Contributing Guide
Development Guide

Links & Resources

📦 NPM Package - Install from npm
🐙 GitHub Repository - Source code and issues
📚 Documentation Wiki - Complete documentation
🐛 Report Issues - Bug reports and feature requests
💬 Discussions - Community discussions
🔗 Model Context Protocol - Learn about MCP
🔗 Kokoro TTS Model - Underlying TTS engine

License

MIT License - See LICENSE for details.

Made with ❤️ by Ross Technologies

MCP server for text-to-speech using Kokoro TTS

Overview

Features

Quick Start

Installation

NPM

NPX (No Installation)

Global Installation

MCP Client Configuration

Usage

Text to Speech Tool

Documentation

Requirements

Troubleshooting

Contributing

Links & Resources

License

Related Servers

Discord Webhook

Tldv

Just Facebook

Coreflux MQTT MCP Server

RSS MCP Server by CData

LGTM Images

DeepL

Notifications

Machine 2 Machine Protocol

Ntfy