OpenAI Speech-to-Text
Transcribe audio files using OpenAI's Speech-to-Text API.
OpenAI Speech-to-Text transcriptions MCP Server
A MCP server that provides audio transcription capabilities using OpenAI's API.
Installation
Setup
- Clone the repository:
git clone https://github.com/Ichigo3766/audio-transcriber-mcp.git
cd audio-transcriber-mcp
- Install dependencies:
npm install
- Build the server:
npm run build
-
Set up your OpenAI API key in your environment variables.
-
Add the server configuration to your environment:
{
"mcpServers": {
"audio-transcriber": {
"command": "node",
"args": [
"/path/to/audio-transcriber-mcp/build/index.js"
],
"env": {
"OPENAI_API_KEY": "",
"OPENAI_BASE_URL": "", // Optional
"OPENAI_MODEL": "" // Optional
}
}
}
}
Replace /path/to/audio-transcriber-mcp with the actual path where you cloned the repository.
Features
Tools
transcribe_audio- Transcribe audio files using OpenAI's API- Takes filepath as a required parameter
- Optional parameters:
- save_to_file: Boolean to save transcription to a file
- language: ISO-639-1 language code (e.g., "en", "es")
License
This MCP server is licensed under the MIT License. This means you are free to use, modify, and distribute the software, subject to the terms and conditions of the MIT License. For more details, please see the LICENSE file in the project repository.
相關伺服器
Human-in-the-Loop
Allows AI assistants to ask questions to humans via Discord.
MCP Email Service
A service for managing emails across multiple providers.
X (Twitter)
An MCP server for interacting with the X (Twitter) API, requiring developer credentials.
AgentRPC
Connect to any function, any language, across network boundaries using AgentRPC.
Bouyomi-chan MCP Server (Node.js)
A Node.js MCP server for the Japanese text-to-speech software Bouyomi-chan.
mcp-bitrix24
MCP server for Bitrix24 Tasks, Workgroups, and Users. Implements MCP/JSON-RPC over STDIO.
dTelecom STT
Real-time speech-to-text for AI assistants. Transcribe audio files with production-grade accuracy. Pay per use with USDC via x402 — no API keys needed.
aiogram-mcp
MCP server for Telegram bots built with aiogram. 30 tools, 7 resources, 3 prompts — messaging, rich media, moderation, interactive keyboards, real-time event streaming, rate limiting, permissions, and audit logging.
mcp-server-unipile
95-tool MCP server for the Unipile unified API — messaging (LinkedIn, WhatsApp, Instagram, Telegram), email (Gmail, Outlook), calendar, LinkedIn search (people, companies, posts, jobs), profile management, connections, InMail, and webhooks.
Chatterbox TTS
Generates text-to-speech audio with automatic playback using the Chatterbox TTS model.