soundside.ai
官方MCP-native AI media generation with x402 pay-per-call. Image, video, audio, and music from 6 providers — composable via resource IDs. USDC on Base.
Soundside — Developer Documentation
AI Media Production Platform for Agents
Soundside exposes 19 MCP tools for generating, editing, composing, extracting, and analyzing media — images, video, audio, music, text, and business artifacts — plus LoRA adapter fine-tuning and server-side video composition. Connect any MCP client. Pay with an API key (credits) or crypto (x402 USDC on Base, no account needed).
Quick Start
# MCP endpoint
https://mcp.soundside.ai/mcp
# Auth: API key or x402 crypto payment
Authorization: Bearer <your-api-key>
POST https://mcp.soundside.ai/mcp
{"jsonrpc":"2.0","id":"1","method":"tools/list","params":{}}
Tools (19)
Generation
| Tool | What It Does | Providers |
|---|---|---|
create_image | Text-to-image, character references | Alibaba (Wan), Grok, Luma, MiniMax, Runway, Vertex AI |
create_video | Text-to-video, image-to-video, video extension | Alibaba (Wan), Grok, Luma, MiniMax, Runway, Vertex AI (Veo 3.1) |
create_audio | TTS, sound effects, voice cloning, voice design | MiniMax, Runway, Vertex AI |
create_music | Music from lyrics and style prompts | MiniMax |
create_text | LLM chat completions, structured output | Grok, MiniMax, Vertex AI (Gemini) |
create_artifact | Charts, presentations, documents, diagrams; bundle mode for multi-artifact packages | plotly, pptx, docx, weasyprint, mermaid, gamma |
Composition
| Tool | What It Does |
|---|---|
compose_video | Server-side pipeline: enrich plan, generate assets in parallel, assemble with transitions, audio ducking, and overlays |
Editing
| Tool | What It Does |
|---|---|
edit_video | Core video transforms: trim, concat, crossfade, speed, loop, color grade, burn subtitles, custom FFmpeg |
edit_audio | Mix, replace, or pad audio on existing media |
compose_media | Add text, overlay media, or build split-screen composites |
apply_effect | Ken Burns, speed ramp, film grain, vignette |
extract_media | Extract frames, frame sets, or audio tracks |
Analysis
| Tool | What It Does | Providers |
|---|---|---|
analyze_media | Technical metadata, vision QA, transcription, segment detection, EDL export | Anthropic, Grok, OpenAI, Qwen, Vertex (+ soundside.ai ffprobe) |
Adapters (LoRA)
| Tool | What It Does | Backends |
|---|---|---|
train_adapter | Train a LoRA adapter from library media | DashScope (Wan), Modal (Hunyuan/LTX) |
list_adapters | List your LoRA adapters | |
manage_adapter | Inspect, deploy, undeploy, delete, or select checkpoint |
Library Management
| Tool | What It Does |
|---|---|
lib_list | Browse projects, collections, resources, lineage, brand kits; query credit balance |
lib_manage | CRUD for projects, collections, resources, brand kits |
lib_share | Share projects with other users by email |
Pricing
Soundside aims to break even on provider pass-through costs with a small margin (~10%). The editing engine and library are priced at $0.01/call; vision QA is $0.03.
Live pricing is always available at:
GET https://mcp.soundside.ai/api/x402/status
This returns machine-readable per-tool, per-provider USDC prices. Prices are DB-driven and may change — always check the endpoint rather than hardcoding.
x402: Pay-Per-Call with Crypto
No API key needed. Pay with USDC on Base (L2) per tool call via EIP-3009 transferWithAuthorization (off-chain signing, facilitator pays gas).
Network: eip155:8453 (Base mainnet)
Token: USDC
Facilitator: Coinbase CDP
See x402 Guide for full setup.
Guides
- Getting Started — First MCP connection in 5 minutes
- x402 Pay-Per-Call — Crypto payments, no account needed
- Tool Reference — Detailed docs for all 19 tools
Examples
- Python — API Key — Connect and generate with httpx
- Python — x402 — Pay-per-call with USDC
- TypeScript — API Key — Node.js MCP client
- OpenClaw Skill — One-line config for OpenClaw agents
Links
- Website: soundside.ai
- MCP Endpoint:
https://mcp.soundside.ai/mcp - Live Pricing:
https://mcp.soundside.ai/api/x402/status - GitHub: github.com/soundside-design/soundside-docs
相关服务器
Remote MCP Server on Cloudflare
A self-hostable MCP server for Cloudflare Workers with OAuth support.
Azure Data Catalog by CData
A read-only MCP server for Azure Data Catalog, powered by CData's JDBC driver.
Rancher MCP Server
Model Context Protocol (MCP) server for the Rancher ecosystem: multi-cluster Kubernetes, Harvester HCI (VMs, storage, networks), and Fleet GitOps.
Akash MCP Server
An MCP server for interacting with the Akash Network, a decentralized cloud computing marketplace.
Netlify
Create, build, deploy, and manage Netlify resources using natural language.
MCP Deployment AWS
A guide and example code for deploying MCP servers cost-effectively on AWS and integrating them with AI agent frameworks.
IBM Instana MCP Server
The IBM Instana MCP server enables seamless interaction with the IBM Instana observability platform, allowing you to access real-time observability data directly within your development workflow.
Concordia MCP
Provides comprehensive analytics for Solana wallets, including real-time portfolio insights and detailed DeFi activity tracking.
PayPal by CData
A read-only MCP server for querying live PayPal data, powered by the CData JDBC Driver.
MCP Riot Server
Integrates with the Riot Games API to provide League of Legends data to AI assistants.