soundside.ai
MCP-native AI media generation with x402 pay-per-call. Image, video, audio, and music from 6 providers — composable via resource IDs. USDC on Base.
Soundside — Developer Documentation
AI Media Production Platform for Agents
Soundside exposes 15 MCP tools for generating, editing, composing, extracting, and analyzing media — images, video, audio, music, text, and business artifacts. Connect any MCP client. Pay with an API key (credits) or crypto (x402 USDC on Base, no account needed).
Quick Start
# MCP endpoint
https://mcp.soundside.ai/mcp
# Auth: API key or x402 crypto payment
Authorization: Bearer <your-api-key>
POST https://mcp.soundside.ai/mcp
{"jsonrpc":"2.0","id":"1","method":"tools/list","params":{}}
Tools (15)
Generation
| Tool | What It Does | Providers |
|---|---|---|
create_image | Text-to-image, character references | Vertex AI, Grok, Runway, MiniMax, Luma |
create_video | Text-to-video, image-to-video, video extension | Vertex AI (Veo 3.1), Runway, MiniMax, Luma, Grok |
create_audio | TTS, deprecated transcribe compatibility shim, voice cloning, sound effects, voice design | MiniMax, Vertex AI, Runway, Creative Freedom |
create_music | Music from lyrics and style prompts | MiniMax, Creative Freedom |
create_text | LLM chat completions, structured output | Vertex AI (Gemini), Grok, MiniMax |
create_artifact | Charts, presentations, documents, diagrams; bundle mode for multi-artifact packages from a single brief | Plotly, PPTX, WeasyPrint, Mermaid, Gamma |
Editing & Analysis
| Tool | What It Does |
|---|---|
edit_video | Core video transforms: trim, concat, crossfade, speed, loop, color grade, burn subtitles, custom FFmpeg |
edit_audio | Mix, replace, or pad audio on existing media |
compose_media | Add text, overlay media, or build split-screen composites |
apply_effect | Ken Burns, speed ramp, film grain, vignette |
extract_media | Extract frames, frame sets, or audio tracks |
analyze_media | Technical analysis, AI vision QA, transcription, segment detection, and EDL export |
Library Management
| Tool | What It Does |
|---|---|
lib_list | Browse projects, collections, resources, lineage, brand kits; query credit balance |
lib_manage | CRUD for projects, collections, resources, brand kits |
lib_share | Share projects with other users by email |
Pricing Philosophy
Soundside aims to break even on provider pass-through costs with a small margin (~10%). The real value is in the editing engine, library management, and self-hosted models — those are priced at $0.01/call ($0.03 for analyze_media vision_qa).
Live pricing is always available at:
GET https://mcp.soundside.ai/api/x402/status
This returns machine-readable per-tool, per-provider USDC prices. Prices are DB-driven and may change — always check the endpoint rather than hardcoding.
x402: Pay-Per-Call with Crypto
No API key needed. Pay with USDC on Base (L2) per tool call.
Network: eip155:8453 (Base mainnet)
Token: USDC
Facilitator: Coinbase CDP
See x402 Guide for full setup.
Guides
- Getting Started — First MCP connection in 5 minutes
- x402 Pay-Per-Call — Crypto payments, no account needed
- Tool Reference — Detailed docs for all 15 tools
Examples
- Python — API Key — Connect and generate with httpx
- Python — x402 — Pay-per-call with USDC
- TypeScript — API Key — Node.js MCP client
- OpenClaw Skill — One-line config for OpenClaw agents
Links
- Website: soundside.ai
- MCP Endpoint:
https://mcp.soundside.ai/mcp - Live Pricing:
https://mcp.soundside.ai/api/x402/status - GitHub: github.com/soundside-design/soundside-docs
เซิร์ฟเวอร์ที่เกี่ยวข้อง
Solana Agent Kit MCP Server
Interact with the Solana blockchain using the Solana Agent Kit.
Adobe AEM
Import a single webpage from any URL to structured HTML content for authoring in AEM Edge Delivery Services. Scrapes the page, analyzes structure, maps to existing blocks, and generates HTML for immediate local preview.
S2T Accelerators
36 enterprise MCP tools for AWS security, infrastructure generation, AI workflows, and AI agent governance.
YouTube
An MCP server for interacting with YouTube's data and services.
Spotify MCP Server
Interact with the Spotify API to manage playlists and control music playback.
Remote MCP Server on Cloudflare
An MCP server designed to run on Cloudflare Workers, featuring OAuth login support.
Google Cloud Healthcare API (FHIR)
Provides healthcare tools for interacting with FHIR resources on Google Cloud Healthcare API and public medical research APIs like PubMed.
Azure Pricing MCP Server
Query Azure retail pricing information using the Azure Retail Prices API.
Litmus MCP Server
Enables LLMs and intelligent systems to interact with Litmus Edge for device configuration, monitoring, and management.
Salesforce MCP Server
Integrates Claude with Salesforce, enabling natural language interactions with your Salesforce data and metadata.