R

Skills Runcomfy Com

ace-step
runcomfy-com
Generate, inpaint, and outpaint music with ACE Step on RunComfy via the `runcomfy` CLI. ACE Step is StepFun-AI's open-weights music foundation model — tag-driven composition (genre, mood, instruments), multilingual lyrics with section markers, 5 s to 4 min stereo output, $0.0002–0.0003 per second (≈ 27× cheaper than ElevenLabs Music). Four endpoints: ACE Step text-to-audio (the default), ACE Step 1.5 text-to-audio (50+ language lyrics, refined structured-lyric handling), ACE Step...
creativeaudioapi
ai-avatar-video
runcomfy-com
Create AI avatar, talking-head, and lip-sync videos on RunComfy via the `runcomfy` CLI. Routes across ByteDance OmniHuman (audio-driven full-body avatar), Wan-AI Wan 2-7 (audio-driven mouth sync via `audio_url` on a portrait), HappyHorse 1.0 (Arena #1 t2v / i2v with in-pass audio), and Seedance v2 Pro (multi-modal cinematic with reference audio + reference subject). Picks the right model for the user's actual intent — UGC voiceover, virtual presenter, dubbed product demo, lip-synced...
videocreativemedia
ai-image-generation
runcomfy-com
Generate and edit images on RunComfy via the `runcomfy` CLI — a smart router across the full image-model catalog: FLUX 2 (Klein 9B/4B, Pro, Dev, Flash, Turbo, Max), Google Nano Banana 2 / Pro, OpenAI GPT Image 2, ByteDance Seedream 5 / 4-5 / 4-0 and Dreamina 4-0, Alibaba Qwen Image and Z-Image Turbo, Wan 2-7. Covers both text-to-image (t2i) and image-to-image / edit (i2i) endpoints — the skill picks the right model for the user's actual intent (typography precision, photoreal portraits,...
creativemediaimage
ai-music
runcomfy-com
Generate AI music on RunComfy via the `runcomfy` CLI — a smart router across the music-model catalog. Routes to ElevenLabs AI Music Generation (premium 44.1 kHz stereo vocal tracks, 5 s–5 min, $0.0083/s) and ACE Step / ACE Step 1.5 (StepFun-AI open-weights, tag-driven composition, multilingual lyrics, $0.0002–0.0003/s, ~27× cheaper), plus ACE Step audio-inpaint (regenerate a time range inside an existing track) and ACE Step audio-outpaint (extend a track before or after). Picks the right...
creativeaudiomedia
ai-video-generation
runcomfy-com
Generate AI videos on RunComfy via the `runcomfy` CLI — a smart router across the full video-model catalog: HappyHorse 1.0 (Arena #1, native in-pass audio), Wan-AI Wan 2-7 (open weights, audio-driven lip-sync), ByteDance Seedance v2 / 1-5 / 1-0 (multi-modal cinematic), Kling 3.0 / 2-6, Google Veo 3-1, MiniMax Hailuo 2-3, ByteDance Dreamina 3-0. Covers text-to-video (t2v), image-to-video (i2v), and Veo's video-extend endpoint. The skill picks the right model for the user's intent (Arena-#1...
creativevideomedia
codex-pet
runcomfy-com
Codex Pet generator on RunComfy. Build a Codex-compatible Codex Pet spritesheet.webp + pet.json from a single reference image, drop it into `${CODEX_HOME:-$HOME/.codex}/pets/ /` and Codex picks it up as a custom Codex Pet next to the 8 built-ins. This skill produces the exact Codex Pet atlas Codex expects (1536x1872 PNG/WebP, 8 cols x 9 rows, 192x208 cells, 9 animation states — idle, running-right, running-left, waving, jumping, failed, waiting, running, review). Calls OpenAI GPT Image 2...
creativeimagedevelopment
controlnet-pose
runcomfy-com
Pose-conditioned generation on RunComfy via the `runcomfy` CLI. Routes across Kling 2-6 Motion Control Pro / Standard (transfer the motion / blocking of a reference video onto a target character), community Wan 2-2 Animate (audio-driven character animation with pose conditioning), and Z-Image Turbo ControlNet LoRA (pose-conditioned image generation from an OpenPose / DWPose / canny / depth control image). Picks the right route based on video vs still and stylized vs photoreal. Triggers on...
creativemediavideo
elevenlabs-music-generation
runcomfy-com
Generate full songs and instrumental tracks with ElevenLabs Music on RunComfy via the `runcomfy` CLI. ElevenLabs Music turns a style description plus structured lyrics into studio-quality 44.1 kHz stereo audio — 5 seconds to 5 minutes — with section-level control (Intro / Verse / Chorus / Bridge), multilingual vocals, and commercial-friendly output. Generate a backing track, a full vocal song, a jingle, a podcast intro, a game loop, or an instrumental bed. Calls `runcomfy run...
creativeaudiomedia
face-swap
runcomfy-com
Swap a face / character into video or images on RunComfy via the `runcomfy` CLI. Routes across community Wan 2-2 Animate (audio-driven character animation + identity swap), GPT Image 2 Edit (single-shot precise face swap on still images via reference composition), Nano Banana Edit (batch identity-preserving swap), Flux Kontext (single-ref high-fidelity local face edit), and Kling 2-6 Motion Control Pro (transfer motion from one performance onto a target character). Picks the right model for...
creativevideoimage
flux-2-klein
runcomfy-com
Tạo hình ảnh với Flux 2 Klein (biến thể nhanh được chưng cất của Flux 2 từ Black Forest Labs) trên RunComfy — tích hợp sẵn các mẫu prompt đã được ghi chép của mô hình để kỹ năng đạt đầu ra sắc nét hơn so với prompt thông thường trên cùng mô hình. Ghi lại các điểm mạnh của Flux 2 Klein (độ trễ dưới giây, tạo kiểu thương hiệu đa tham chiếu, prompt ưu tiên chủ ngữ khai báo), chiến lược số bước (4–8 để lặp nhanh, ~25 để hoàn thiện), sự đánh đổi giữa biến thể 9B và 4B, và thời điểm chuyển hướng sang Flux 2 Pro /...
creativeimageresearch
flux-kontext
runcomfy-com
Chỉnh sửa hình ảnh với Flux 1 Kontext Pro (mô hình chỉnh sửa cục bộ chính xác của Black Forest Labs) trên RunComfy — được tích hợp sẵn các mẫu prompt đã được ghi chép của mô hình, giúp kỹ năng đạt đầu ra sắc nét hơn so với việc dùng prompt thô trên cùng mô hình. Ghi lại điểm mạnh của Flux Kontext (chỉnh sửa cục bộ chính xác từ một tham chiếu, kiểm soát prompt mạnh mẽ, đầu ra chất lượng cao nhất quán), lược đồ (một hình ảnh + prompt), và thời điểm chuyển hướng sang Nano Banana Edit / GPT Image 2 edit / Flux 2 Klein. Gọi...
creativeimagedocument
gpt-image-2
runcomfy-com
Generate and edit images with OpenAI GPT Image 2 (ChatGPT Images 2.0) on RunComfy. Documents GPT Image 2's strengths (embedded text, logos, multilingual typography, instruction precision), its 3 fixed sizes, edit-with-preservation language, and when to route to a sibling (Flux 2 / Nano Banana Pro / Seedream) instead. Calls `runcomfy run openai/gpt-image-2/text-to-image` or `/edit` through the local RunComfy CLI. Triggers on "gpt image 2", "gpt-image-2", "ChatGPT Images 2", "image 2", or any...
creativeimagemedia
gpt-image-edit
runcomfy-com
Edit images with OpenAI GPT Image 2 (the `/edit` endpoint of ChatGPT Images 2.0) on RunComfy — bundled with the model's documented prompting patterns so the skill gets sharper output than naive prompting against the same model. Documents GPT Image Edit's strengths (preservation language, multilingual in-image text editing, multi-reference up to 10 images, layout / typography precision), the schema, and when to route to Nano Banana Edit / Flux Kontext / GPT Image 2 t2i instead. Calls...
imagecreativeapi
happyhorse-1-0
runcomfy-com
Generate text-to-video with HappyHorse 1.0 on RunComfy. Documents HappyHorse 1.0's strengths (#1 on Artificial Analysis Video Arena, native 1080p with in-pass synchronized audio, multi-shot character consistency, 6-language prompt support), the duration / aspect-ratio / resolution schema, and when to route to Wan 2.7 / Seedance 2 / LTX 2 instead. Calls `runcomfy run happyhorse/happyhorse-1-0/text-to-video` through the local RunComfy CLI. Triggers on "happyhorse", "happy horse", "happyhorse...
creativevideomedia
image-edit
runcomfy-com
Chỉnh sửa hình ảnh trên RunComfy — kỹ năng này là bộ định tuyến thông minh, khớp ý định của người dùng với mô hình chỉnh sửa phù hợp trong danh mục RunComfy. Chọn Nano Banana Edit (xử lý hàng loạt tối đa 20 ảnh, giữ nguyên nhận dạng mặc định), OpenAI GPT Image 2 Edit (viết lại văn bản trong ảnh đa ngôn ngữ, kết hợp nhiều tham chiếu, bố cục chính xác), Flux Kontext Pro (chỉnh sửa cục bộ độ trung thực cao với một tham chiếu), hoặc Z-Image Turbo Inpaint (chỉnh sửa vùng chính xác dựa trên mặt nạ). Tổng hợp các mẫu gợi ý đã
creativeimagemedia
image-inpainting
runcomfy-com
Mask-driven image inpainting on RunComfy via the `runcomfy` CLI. Routes to Tongyi MAI Z-Image Turbo Inpainting (the dedicated inpainting endpoint with mask, strength, and control-scale) and to identity-preserving edit models (Nano Banana 2 Edit, GPT Image 2 Edit, FLUX Kontext Pro) when a mask isn't available and the region must be described instead. Use for object removal, watermark removal, region replacement, blemish cleanup, and any controlled local edit where a binary mask defines the...
creativeimagemedia
image-outpainting
runcomfy-com
Image outpainting on RunComfy via the `runcomfy` CLI — extend a still beyond its original canvas, fill in what the camera didn't capture, change aspect ratio (square → 16:9, portrait → landscape) while preserving the original content. Routes across Nano Banana 2 Edit (default, spatial-language driven), GPT Image 2 Edit (multi-ref with reference-style matching), FLUX Kontext Pro (single-shot maximum-preservation), and the brand edit endpoints (Seedream / Dreamina / Qwen / FLUX 2). Picks the...
creativeimagemedia
image-to-video
runcomfy-com
Animate any still image on RunComfy — this skill is a smart router that matches the user's intent to the right i2v model in the RunComfy catalog. Picks HappyHorse 1.0 I2V (Arena #1, native audio, identity preservation) for general animations, Wan 2.7 with `audio_url` for custom-voiceover lip-sync, or Seedance 2.0 Pro for multi-modal animation from image + reference video + reference audio. Bundles each model's documented prompting patterns so the caller gets sharper output without burning...
creativevideomedia
kling-3-0
runcomfy-com
Tạo video Kling 3.0 trên RunComfy. Kling 3.0 (còn gọi là Kling V3.0) là mô hình video đa cảnh thế hệ thứ ba của Kuaishou Technology, có âm thanh đồng bộ gốc và nhận diện nhân vật nhất quán giữa các cảnh. Kỹ năng này bao gồm tất cả sáu điểm cuối của Kling 3.0, trải dài ba cấp độ kết xuất (Standard, Pro, 4K) và hai chế độ (văn bản thành video, hình ảnh thành video). Gọi runcomfy run kling/kling-3.0/ / thông qua CLI RunComfy cục bộ. Kích hoạt bằng "kling", "kling 3.0", "kling v3", "kling pro",...
videocreativemedia
lipsync
runcomfy-com
Lip-sync a face to a specific audio track on RunComfy via the `runcomfy` CLI. Routes across ByteDance OmniHuman (audio-driven full-body avatar from a portrait + audio), Sync Labs sync v2 / Pro (state-of-the-art mouth sync onto a video), Kling lipsync (audio-to- video and text-to-video with synced speech), and Creatify lipsync. The skill picks the right endpoint for the user's actual intent — portrait still + audio (avatar-style), source video + audio (mouth- swap on existing footage), or...
creativevideomedia
nano-banana-2
runcomfy-com
Tạo hình ảnh với Google Nano Banana 2 (dòng flash Gemini dùng để chuyển văn bản thành hình ảnh) trên RunComfy — được tích hợp sẵn các mẫu gợi ý đã được ghi chép của mô hình, giúp kỹ năng này cho ra kết quả sắc nét hơn so với việc gợi ý thông thường trên cùng một mô hình. Ghi lại các điểm mạnh của Nano Banana 2 (lặp nhanh, hiển thị chữ trong ảnh, khung hình dễ đoán, có thể dùng ngữ cảnh từ web), giá theo bậc độ phân giải, mức điều chỉnh độ an toàn, và thời điểm nên chuyển sang Nano Banana Pro / GPT Image 2 / Flux 2 / Seedream...
creativeimageresearch
nano-banana-edit
runcomfy-com
Edit images with Google Nano Banana 2 (image-to-image edit endpoint) on RunComfy. Documents Nano Banana Edit's strengths (preserve subject identity, swap background, localize edits with spatial language, multi-image batch edits up to 20 inputs), the schema, and when to route to GPT Image 2 edit / Flux Kontext / Nano Banana 2 t2i instead. Calls `runcomfy run google/nano-banana-2/edit` through the local RunComfy CLI. Triggers on "nano banana edit", "edit with nano banana", "image edit nano...
creativeimageapi
relight
runcomfy-com
Relight a still image — change the lighting setup, color temperature, direction, or mood — on RunComfy via the `runcomfy` CLI. Routes to Qwen Edit 2509's dedicated `relight` LoRA endpoint for purpose-built relighting, with fallback to identity-preserving edit endpoints (Nano Banana 2 Edit, GPT Image 2 Edit, FLUX Kontext Pro) when prose lighting language is enough. Use for product relighting (studio softbox → window light), portrait mood shift (overcast → golden hour), or color-grade change....
creativeimagemedia
runcomfy-cli
runcomfy-com
Run any model on RunComfy from the command line. The `runcomfy` CLI is one binary, one auth, hundreds of model endpoints — image generation, image edit, video generation, image-to-video, lip-sync, face swap, video edit, inpainting, outpainting, extend, ControlNet, relight, upscale, LoRA training and more. Submit a request, poll for status, download the output. This skill teaches the agent how to install, authenticate, discover model schemas, invoke models, stream / poll / no-wait, script in...
creativemediaapi
seedance-v2
runcomfy-com
Generate cinematic short-form video with ByteDance Seedance 2.0 Pro on RunComfy. Documents Seedance 2.0 Pro's strengths (multi-modal references — up to 9 images, 3 videos, 3 audio — synchronized in-pass audio with natural lip-sync, cinematic motion refinement), the 4–15s duration schema, and when to route to HappyHorse 1.0 / Wan 2.7 / Kling instead. Calls `runcomfy run bytedance/seedance-v2/pro` through the local RunComfy CLI. Triggers on "seedance", "seedance 2", "seedance v2", "seedance...
creativevideomedia
video-edit
runcomfy-com
Chỉnh sửa video có sẵn trên RunComfy — kỹ năng này là bộ định tuyến thông minh, khớp ý định của người dùng với mô hình chỉnh sửa phù hợp trong danh mục RunComfy. Lựa chọn Wan 2.7 Edit-Video (tạo lại phong cách tổng thể / thay đổi nền / thay đổi bao bì, giữ nguyên nhận dạng và chuyển động), Kling 2.6 Pro Motion Control (chuyển chuyển động chính xác từ video tham chiếu sang nhân vật mục tiêu), hoặc Lucy Edit Restyle (tạo lại phong cách nhẹ nhàng, giữ nguyên nhận dạng / thay đổi trang phục). Tổng hợp các mẫu gợi ý đã được ghi lại của từng mô hình để kỹ
videocreativemedia
video-extend
runcomfy-com
Extend or continue an existing video clip on RunComfy via the `runcomfy` CLI. Routes to Google Veo 3-1's `extend-video` and `fast/extend-video` endpoints — pick the source video plus a prompt describing what should happen next, and the model produces a clip that continues the original with consistent motion, lighting, and subject identity. Use when the user has a short Veo clip and wants it longer, or wants a chained narrative built shot-by-shot from a single seed clip. Triggers on "extend...
videocreativemedia
video-inpainting
runcomfy-com
Region edits across video frames on RunComfy via the `runcomfy` CLI — remove an object that appears across many frames, clean up wires or watermarks, replace a region with matching motion. Routes across Wan 2-7 edit-video (default, prompt-driven region edits with spatial language), Lucy Edit Restyle (identity-stable region-aware restyle), and Seedream 4-0 edit-sequential (when treating the clip as a frame stack). Picks the right route based on whether the change is prose-driven,...
videocreativemedia
video-outpainting
runcomfy-com
Video outpainting on RunComfy via the `runcomfy` CLI — extend the spatial canvas of a video, change aspect ratio (9:16 vertical to 16:9 horizontal or vice versa), add environment beyond the original frame while preserving the central action. Routes prompt-shaped spatial extension through Wan 2-7 edit-video and points the agent at dedicated ComfyUI outpaint workflows when seam quality matters for hero delivery. Triggers on "video outpaint", "video outpainting", "extend video canvas", "expand...
videocreativemedia
wan-2-7
runcomfy-com
Generate text-to-video with Wan 2.7 (Wan-AI's flagship motion model) on RunComfy. Documents Wan 2.7's strengths (multi-reference conditioning, audio-driven lip-sync via `audio_url`, smoother transitions, prompt expansion), the duration / resolution / aspect-ratio schema, and when to route to HappyHorse 1.0 / Seedance 2.0 / Kling / LTX 2 instead. Calls `runcomfy run wan-ai/wan-2-7/text-to-video` through the local RunComfy CLI. Triggers on "wan", "wan 2.7", "wan-2-7", "wan video", or any...
creativevideomedia