OpenAI Speech-to-Text
Transcribe audio files using OpenAI's Speech-to-Text API.
OpenAI Speech-to-Text transcriptions MCP Server
A MCP server that provides audio transcription capabilities using OpenAI's API.
Installation
Setup
- Clone the repository:
git clone https://github.com/Ichigo3766/audio-transcriber-mcp.git
cd audio-transcriber-mcp
- Install dependencies:
npm install
- Build the server:
npm run build
-
Set up your OpenAI API key in your environment variables.
-
Add the server configuration to your environment:
{
"mcpServers": {
"audio-transcriber": {
"command": "node",
"args": [
"/path/to/audio-transcriber-mcp/build/index.js"
],
"env": {
"OPENAI_API_KEY": "",
"OPENAI_BASE_URL": "", // Optional
"OPENAI_MODEL": "" // Optional
}
}
}
}
Replace /path/to/audio-transcriber-mcp with the actual path where you cloned the repository.
Features
Tools
transcribe_audio- Transcribe audio files using OpenAI's API- Takes filepath as a required parameter
- Optional parameters:
- save_to_file: Boolean to save transcription to a file
- language: ISO-639-1 language code (e.g., "en", "es")
License
This MCP server is licensed under the MIT License. This means you are free to use, modify, and distribute the software, subject to the terms and conditions of the MIT License. For more details, please see the LICENSE file in the project repository.
Related Servers
Slack
Interact with Slack workspaces using the Slack API.
Discord
A server for reading and sending messages on Discord.
Gmail
Query live Gmail data using LLMs via CData's read-only MCP server.
DeepL
Translate text using the DeepL API.
Slack
Interact with Slack workspaces, enabling message sending, channel management, and user interactions.
Gmail MCP
An MCP server for interacting with your Gmail account using AI assistants.
MCP IDE Bridge
An open-source messaging server for client-to-client communication using MCP HTTP Streamable messaging.
Voice MCP
Enables voice interactions with Claude and other LLMs using an OpenAI API key for STT/TTS services.
Twilio Manager MCP
Manage Twilio resources such as subaccounts, phone numbers, and regulatory bundles using the Twilio API.
WhatsApp (TypeScript/Baileys)
Connects a personal WhatsApp account to an AI agent using the WhatsApp Web multi-device API.