transloadit-media-processing作者: github

Cloud-based media processing for video, audio, images, and documents using 86+ specialized robots. Supports video encoding (HLS, MP4, WebM), thumbnail generation, image resizing/watermarking, audio transcoding, document OCR, and speech-to-text via chainable processing steps Access via MCP server (recommended for IDE integration) or CLI; requires free Transloadit account with API credentials Build multi-step pipelines by chaining robot operations together using the "use" field; reuse...

npx skills add https://github.com/github/awesome-copilot --skill transloadit-media-processing

Transloadit Media Processing

Process, transform, and encode media files using Transloadit's cloud infrastructure. Supports video, audio, images, and documents with 86+ specialized processing robots.

When to Use This Skill

Use this skill when you need to:

  • Encode video to HLS, MP4, WebM, or other formats
  • Generate thumbnails or animated GIFs from video
  • Resize, crop, watermark, or optimize images
  • Convert between image formats (JPEG, PNG, WebP, AVIF, HEIF)
  • Extract or transcode audio (MP3, AAC, FLAC, WAV)
  • Concatenate video or audio clips
  • Add subtitles or overlay text on video
  • OCR documents (PDF, scanned images)
  • Run speech-to-text or text-to-speech
  • Apply AI-based content moderation or object detection
  • Build multi-step media pipelines that chain operations together

Setup

Option A: MCP Server (recommended for Copilot)

Add the Transloadit MCP server to your IDE config. This gives the agent direct access to Transloadit tools (create_template, create_assembly, list_assembly_notifications, etc.).

VS Code / GitHub Copilot (.vscode/mcp.json or user settings):

{
  "servers": {
    "transloadit": {
      "command": "npx",
      "args": ["-y", "@transloadit/mcp-server", "stdio"],
      "env": {
        "TRANSLOADIT_KEY": "YOUR_AUTH_KEY",
        "TRANSLOADIT_SECRET": "YOUR_AUTH_SECRET"
      }
    }
  }
}

Get your API credentials at https://transloadit.com/c/-/api-credentials

Option B: CLI

If you prefer running commands directly:

npx -y @transloadit/node assemblies create \
  --steps '{"encoded": {"robot": "/video/encode", "use": ":original", "preset": "hls-1080p"}}' \
  --wait \
  --input ./my-video.mp4

Core Workflows

Encode Video to HLS (Adaptive Streaming)

{
  "steps": {
    "encoded": {
      "robot": "/video/encode",
      "use": ":original",
      "preset": "hls-1080p"
    }
  }
}

Generate Thumbnails from Video

{
  "steps": {
    "thumbnails": {
      "robot": "/video/thumbs",
      "use": ":original",
      "count": 8,
      "width": 320,
      "height": 240
    }
  }
}

Resize and Watermark Images

{
  "steps": {
    "resized": {
      "robot": "/image/resize",
      "use": ":original",
      "width": 1200,
      "height": 800,
      "resize_strategy": "fit"
    },
    "watermarked": {
      "robot": "/image/resize",
      "use": "resized",
      "watermark_url": "https://example.com/logo.png",
      "watermark_position": "bottom-right",
      "watermark_size": "15%"
    }
  }
}

OCR a Document

{
  "steps": {
    "recognized": {
      "robot": "/document/ocr",
      "use": ":original",
      "provider": "aws",
      "format": "text"
    }
  }
}

Concatenate Audio Clips

{
  "steps": {
    "imported": {
      "robot": "/http/import",
      "url": ["https://example.com/clip1.mp3", "https://example.com/clip2.mp3"]
    },
    "concatenated": {
      "robot": "/audio/concat",
      "use": "imported",
      "preset": "mp3"
    }
  }
}

Multi-Step Pipelines

Steps can be chained using the "use" field. Each step references a previous step's output:

{
  "steps": {
    "resized": {
      "robot": "/image/resize",
      "use": ":original",
      "width": 1920
    },
    "optimized": {
      "robot": "/image/optimize",
      "use": "resized"
    },
    "exported": {
      "robot": "/s3/store",
      "use": "optimized",
      "bucket": "my-bucket",
      "path": "processed/${file.name}"
    }
  }
}

Key Concepts

  • Assembly: A single processing job. Created via create_assembly (MCP) or assemblies create (CLI).
  • Template: A reusable set of steps stored on Transloadit. Created via create_template (MCP) or templates create (CLI).
  • Robot: A processing unit (e.g., /video/encode, /image/resize). See full list at https://transloadit.com/docs/transcoding/
  • Steps: JSON object defining the pipeline. Each key is a step name, each value configures a robot.
  • :original: Refers to the uploaded input file.

Tips

  • Use --wait with the CLI to block until processing completes.
  • Use preset values (e.g., "hls-1080p", "mp3", "webp") for common format targets instead of specifying every parameter.
  • Chain "use": "step_name" to build multi-step pipelines without intermediate downloads.
  • For batch processing, use /http/import to pull files from URLs, S3, GCS, Azure, FTP, or Dropbox.
  • Templates can include ${variables} for dynamic values passed at assembly creation time.

來自 github 的更多技能

console-rendering
by github
Instructions for using the struct tag-based console rendering system in Go
acquire-codebase-knowledge
by github
Use this skill when the user explicitly asks to map, document, or onboard into an existing codebase. Trigger for prompts like "map this codebase", "document…
acreadiness-assess
by github
Run the AgentRC readiness assessment on the current repository and produce a static HTML dashboard at reports/index.html. Wraps `npx github:microsoft/agentrc…
acreadiness-generate-instructions
by github
Generate tailored AI agent instruction files via AgentRC instructions command. Produces .github/copilot-instructions.md (default, recommended for Copilot in VS…
acreadiness-policy
by github
Help the user pick, write, or apply an AgentRC policy. Policies customise readiness scoring by disabling irrelevant checks, overriding impact/level, setting…
add-educational-comments
by github
Add educational comments to code files to transform them into effective learning resources. Adapts explanation depth and tone to three configurable knowledge levels: beginner, intermediate, and advanced Automatically requests a file if none is provided, with numbered list matching for quick selection Expands files by up to 125% using educational comments only (hard limit: 400 new lines; 300 for files over 1,000 lines) Preserves file encoding, indentation style, syntax correctness, and...
adobe-illustrator-scripting
by github
Write, debug, and optimize Adobe Illustrator automation scripts using ExtendScript (JavaScript/JSX). Use when creating or modifying scripts that manipulate…
agent-governance
by github
Declarative policies, intent classification, and audit trails for controlling AI agent tool access and behavior. Composable governance policies define allowed/blocked tools, content filters, rate limits, and approval requirements — stored as configuration, not code Semantic intent classification detects dangerous prompts (data exfiltration, privilege escalation, prompt injection) before tool execution using pattern-based signals Tool-level governance decorator enforces policies at function...

NotebookLM 網頁匯入器

一鍵將網頁和 YouTube 影片匯入 NotebookLM。超過 200,000 位使用者信賴。

安裝 Chrome 擴充功能