OpenAI Speech-to-Text
Transcribe audio files using OpenAI's Speech-to-Text API.
OpenAI Speech-to-Text transcriptions MCP Server
A MCP server that provides audio transcription capabilities using OpenAI's API.
Installation
Setup
- Clone the repository:
git clone https://github.com/Ichigo3766/audio-transcriber-mcp.git
cd audio-transcriber-mcp
- Install dependencies:
npm install
- Build the server:
npm run build
-
Set up your OpenAI API key in your environment variables.
-
Add the server configuration to your environment:
{
"mcpServers": {
"audio-transcriber": {
"command": "node",
"args": [
"/path/to/audio-transcriber-mcp/build/index.js"
],
"env": {
"OPENAI_API_KEY": "",
"OPENAI_BASE_URL": "", // Optional
"OPENAI_MODEL": "" // Optional
}
}
}
}
Replace /path/to/audio-transcriber-mcp with the actual path where you cloned the repository.
Features
Tools
transcribe_audio- Transcribe audio files using OpenAI's API- Takes filepath as a required parameter
- Optional parameters:
- save_to_file: Boolean to save transcription to a file
- language: ISO-639-1 language code (e.g., "en", "es")
License
This MCP server is licensed under the MIT License. This means you are free to use, modify, and distribute the software, subject to the terms and conditions of the MIT License. For more details, please see the LICENSE file in the project repository.
Похожие серверы
Whissle MCP Server
Access Whissle API for speech-to-text, diarization, translation, and text summarization.
CData Microsoft Teams MCP Server
A read-only MCP server for querying live Microsoft Teams data, powered by CData.
MCP ChatGPT Proxy
A production-ready MCP server for ChatGPT and o3-pro, featuring caching, cost tracking, and rate limiting.
Brainstorm MCP
Slack for AI agents - a local service where agents can join projects, message each other, and share resources in a structured workspace
Slack Notify
Send notifications to Slack using OAuth bot tokens.
MCP-Pushover Bridge
Enables AI assistants to send push notifications via the Pushover service.
Telegram Notify MCP
Send Telegram notifications from AI agents - text, photos, and documents via Bot API
Advanced TTS MCP Server
A high-quality, feature-rich Text-to-Speech (TTS) server for generating natural and expressive speech with advanced controls.
Instantly
Manage email campaigns and leads using the Instantly.ai v2 API.
FastMail
Interact with FastMail's email, calendar, and contacts via its JMAP API.