R

Skills de Runcomfy Com

Generate, inpaint, and outpaint music with ACE Step on RunComfy via the `runcomfy` CLI. ACE Step is StepFun-AI's open-weights music foundation model — tag-driven composition (genre, mood, instruments), multilingual lyrics with section markers, 5 s to 4 min stereo output, $0.0002–0.0003 per second (≈ 27× cheaper than ElevenLabs Music). Four endpoints: ACE Step text-to-audio (the default), ACE Step 1.5 text-to-audio (50+ language lyrics, refined structured-lyric handling), ACE Step...

creativeaudioapi

ai-avatar-video

Create AI avatar, talking-head, and lip-sync videos on RunComfy via the `runcomfy` CLI. Routes across ByteDance OmniHuman (audio-driven full-body avatar), Wan-AI Wan 2-7 (audio-driven mouth sync via `audio_url` on a portrait), HappyHorse 1.0 (Arena #1 t2v / i2v with in-pass audio), and Seedance v2 Pro (multi-modal cinematic with reference audio + reference subject). Picks the right model for the user's actual intent — UGC voiceover, virtual presenter, dubbed product demo, lip-synced...

videocreativemedia

ai-image-generation

We need to translate the given text from English to Spanish, preserving the name "ai-image-generation" only if it appears in the source text. The source text does not contain that name; it only appears in the instruction as the name to preserve, but not in the text to translate. So we just translate the text inside <text>. We must preserve product names, protocol names, URLs, numbers, technical terms. The text includes: RunComfy, runcomfy CLI, FLUX 2 (Klein 9B/4B, Pro, Dev, Flash, Turbo, Max), Google Nano Banana 2 / Pro, OpenAI GPT Image 2, ByteDance Seedream 5 / 4-5 / 4-0 and Dreamina 4-0, Alibaba Qwen Image and Z-Image Turbo, Wan 2-7, text-to-image (t2i), image-to-image / edit (i2i). Also "typography precision, photoreal portraits,..." Keep all these as is. Translate the rest naturally. The text ends

creativemediaimage

Generate AI music on RunComfy via the `runcomfy` CLI — a smart router across the music-model catalog. Routes to ElevenLabs AI Music Generation (premium 44.1 kHz stereo vocal tracks, 5 s–5 min, $0.0083/s) and ACE Step / ACE Step 1.5 (StepFun-AI open-weights, tag-driven composition, multilingual lyrics, $0.0002–0.0003/s, ~27× cheaper), plus ACE Step audio-inpaint (regenerate a time range inside an existing track) and ACE Step audio-outpaint (extend a track before or after). Picks the right...

creativeaudiomedia

ai-video-generation

Generate AI videos on RunComfy via the `runcomfy` CLI — a smart router across the full video-model catalog: HappyHorse 1.0 (Arena #1, native in-pass audio), Wan-AI Wan 2-7 (open weights, audio-driven lip-sync), ByteDance Seedance v2 / 1-5 / 1-0 (multi-modal cinematic), Kling 3.0 / 2-6, Google Veo 3-1, MiniMax Hailuo 2-3, ByteDance Dreamina 3-0. Covers text-to-video (t2v), image-to-video (i2v), and Veo's video-extend endpoint. The skill picks the right model for the user's intent (Arena-#1...

creativevideomedia

Codex Pet generator on RunComfy. Build a Codex-compatible Codex Pet spritesheet.webp + pet.json from a single reference image, drop it into `${CODEX_HOME:-$HOME/.codex}/pets/ /` and Codex picks it up as a custom Codex Pet next to the 8 built-ins. This skill produces the exact Codex Pet atlas Codex expects (1536x1872 PNG/WebP, 8 cols x 9 rows, 192x208 cells, 9 animation states — idle, running-right, running-left, waving, jumping, failed, waiting, running, review). Calls OpenAI GPT Image 2...

creativeimagedevelopment

controlnet-pose

Pose-conditioned generation on RunComfy via the `runcomfy` CLI. Routes across Kling 2-6 Motion Control Pro / Standard (transfer the motion / blocking of a reference video onto a target character), community Wan 2-2 Animate (audio-driven character animation with pose conditioning), and Z-Image Turbo ControlNet LoRA (pose-conditioned image generation from an OpenPose / DWPose / canny / depth control image). Picks the right route based on video vs still and stylized vs photoreal. Triggers on...

creativemediavideo

elevenlabs-music-generation

Generate full songs and instrumental tracks with ElevenLabs Music on RunComfy via the `runcomfy` CLI. ElevenLabs Music turns a style description plus structured lyrics into studio-quality 44.1 kHz stereo audio — 5 seconds to 5 minutes — with section-level control (Intro / Verse / Chorus / Bridge), multilingual vocals, and commercial-friendly output. Generate a backing track, a full vocal song, a jingle, a podcast intro, a game loop, or an instrumental bed. Calls `runcomfy run...

creativeaudiomedia

Swap a face / character into video or images on RunComfy via the `runcomfy` CLI. Routes across community Wan 2-2 Animate (audio-driven character animation + identity swap), GPT Image 2 Edit (single-shot precise face swap on still images via reference composition), Nano Banana Edit (batch identity-preserving swap), Flux Kontext (single-ref high-fidelity local face edit), and Kling 2-6 Motion Control Pro (transfer motion from one performance onto a target character). Picks the right model for...

creativevideoimage

Genera imágenes con Flux 2 Klein (variante rápida destilada de Flux 2 de Black Forest Labs) en RunComfy, incluyendo los patrones de prompting documentados del modelo para que la habilidad obtenga resultados más precisos que con prompting básico sobre el mismo modelo. Documenta las fortalezas de Flux 2 Klein (latencia de menos de un segundo, estilo de marca con múltiples referencias, prompts declarativos centrados en el sujeto), la estrategia de pasos (4–8 para iteración rápida, ~25 para refinamiento), el equilibrio entre las variantes de 9B y 4B, y cuándo redirigir a Flux 2 Pro /...

creativeimageresearch

Edita imágenes con Flux 1 Kontext Pro (el modelo de edición local precisa de imágenes de Black Forest Labs) en RunComfy, incluyendo los patrones de prompting documentados del modelo para que la habilidad obtenga resultados más precisos que con prompting básico sobre el mismo modelo. Documenta las fortalezas de Flux Kontext (ediciones locales precisas con una sola referencia, fuerte control de prompting, resultados consistentes de alta fidelidad), el esquema (una sola imagen + prompt), y cuándo redirigir a Nano Banana Edit / GPT Image 2 edit / Flux 2 Klein en su lugar. Llama...

creativeimagedocument

Generate and edit images with OpenAI GPT Image 2 (ChatGPT Images 2.0) on RunComfy. Documents GPT Image 2's strengths (embedded text, logos, multilingual typography, instruction precision), its 3 fixed sizes, edit-with-preservation language, and when to route to a sibling (Flux 2 / Nano Banana Pro / Seedream) instead. Calls `runcomfy run openai/gpt-image-2/text-to-image` or `/edit` through the local RunComfy CLI. Triggers on "gpt image 2", "gpt-image-2", "ChatGPT Images 2", "image 2", or any...

creativeimagemedia

Edit images with OpenAI GPT Image 2 (the `/edit` endpoint of ChatGPT Images 2.0) on RunComfy — bundled with the model's documented prompting patterns so the skill gets sharper output than naive prompting against the same model. Documents GPT Image Edit's strengths (preservation language, multilingual in-image text editing, multi-reference up to 10 images, layout / typography precision), the schema, and when to route to Nano Banana Edit / Flux Kontext / GPT Image 2 t2i instead. Calls...

imagecreativeapi

Generate text-to-video with HappyHorse 1.0 on RunComfy. Documents HappyHorse 1.0's strengths (#1 on Artificial Analysis Video Arena, native 1080p with in-pass synchronized audio, multi-shot character consistency, 6-language prompt support), the duration / aspect-ratio / resolution schema, and when to route to Wan 2.7 / Seedance 2 / LTX 2 instead. Calls `runcomfy run happyhorse/happyhorse-1-0/text-to-video` through the local RunComfy CLI. Triggers on "happyhorse", "happy horse", "happyhorse...

creativevideomedia

Edita imágenes en RunComfy: esta habilidad es un enrutador inteligente que empareja la intención del usuario con el modelo de edición adecuado en el catálogo de RunComfy. Selecciona Nano Banana Edit (lote de hasta 20, con preservación de identidad por defecto), OpenAI GPT Image 2 Edit (reescritura de texto en imagen multilingüe, composición multirreferencia, precisión de diseño), Flux Kontext Pro (edición local de alta fidelidad con una sola referencia) o Z-Image Turbo Inpaint (edición precisa de regiones guiada por máscara). Agrupa los patrones de indicaciones documentados de cada modelo para que la habilidad obtenga...

creativeimagemedia

image-inpainting

Mask-driven image inpainting on RunComfy via the `runcomfy` CLI. Routes to Tongyi MAI Z-Image Turbo Inpainting (the dedicated inpainting endpoint with mask, strength, and control-scale) and to identity-preserving edit models (Nano Banana 2 Edit, GPT Image 2 Edit, FLUX Kontext Pro) when a mask isn't available and the region must be described instead. Use for object removal, watermark removal, region replacement, blemish cleanup, and any controlled local edit where a binary mask defines the...

creativeimagemedia

image-outpainting

Image outpainting on RunComfy via the `runcomfy` CLI — extend a still beyond its original canvas, fill in what the camera didn't capture, change aspect ratio (square → 16:9, portrait → landscape) while preserving the original content. Routes across Nano Banana 2 Edit (default, spatial-language driven), GPT Image 2 Edit (multi-ref with reference-style matching), FLUX Kontext Pro (single-shot maximum-preservation), and the brand edit endpoints (Seedream / Dreamina / Qwen / FLUX 2). Picks the...

creativeimagemedia

We need to translate the given text from English to Spanish. The text describes an agent skill called "image-to-video" but the instruction says to preserve the name only if it appears in the source text. The name "image-to-video" does not appear in the provided text, so we don't include it. We must preserve product names, protocol names, URLs, numbers, and technical terms. So "RunComfy", "HappyHorse 1.0 I2V", "Arena #1", "Wan 2.7", "audio_url", "Seedance 2.0 Pro" should remain as is. Also "i2v" is a technical term. Translate the rest naturally. The text: "Animate any still image on RunComfy — this skill is a smart router that matches the user's intent to the right i2v model in the RunComfy catalog. Picks HappyHorse 1.0 I2V (Arena #1, native audio, identity preservation) for general animations, Wan 2.

creativevideomedia

Generación de video Kling 3.0 en RunComfy. Kling 3.0 (también llamado Kling V3.0) es el modelo de video multishot de tercera generación de Kuaishou Technology, con audio sincronizado nativo e identidad de personaje consistente entre tomas. Esta habilidad cubre los seis endpoints de Kling 3.0, abarcando tres niveles de renderizado (Standard, Pro, 4K) y dos modos (texto a video, imagen a video). Ejecuta runcomfy run kling/kling-3.0/ / a través de la CLI local de RunComfy. Se activa con "kling", "kling 3.0", "kling v3", "kling pro",...

videocreativemedia

We need to translate the given text from English to Spanish, preserving the name "lipsync" and other technical terms like "RunComfy", "runcomfy CLI", "ByteDance OmniHuman", "Sync Labs sync v2 / Pro", "Kling lipsync", "Creatify lipsync". Also preserve URLs, numbers, etc. The instruction says to translate only the text inside <text>, and not include the name unless it appears in the source text. The name "lipsync" appears in the source text as "Lip-sync" at the beginning, so we should translate that as "Sincronización de labios" or similar? But careful: the instruction says "Preserve product names, protocol names, URLs, numbers, and technical terms." "lipsync" is a technical term? It's a skill name. The instruction says "Name to preserve: lipsync". So we should preserve the word "lipsync" as is when it appears. In the source, it's "Lip-sync" (capitalized,

creativevideomedia

Genera imágenes con Google Nano Banana 2 (texto a imagen de nivel flash de la familia Gemini) en RunComfy, incluye los patrones de indicaciones documentados del modelo para que la habilidad obtenga resultados más precisos que con indicaciones simples usando el mismo modelo. Documenta las fortalezas de Nano Banana 2 (iteración rápida, renderizado de tipografía en imagen, encuadre predecible, contexto web opcional), los precios por nivel de resolución, el dial de tolerancia de seguridad y cuándo dirigirse a Nano Banana Pro / GPT Image 2 / Flux 2 / Seedream...

creativeimageresearch

nano-banana-edit

Edit images with Google Nano Banana 2 (image-to-image edit endpoint) on RunComfy. Documents Nano Banana Edit's strengths (preserve subject identity, swap background, localize edits with spatial language, multi-image batch edits up to 20 inputs), the schema, and when to route to GPT Image 2 edit / Flux Kontext / Nano Banana 2 t2i instead. Calls `runcomfy run google/nano-banana-2/edit` through the local RunComfy CLI. Triggers on "nano banana edit", "edit with nano banana", "image edit nano...

creativeimageapi

Relight a still image — change the lighting setup, color temperature, direction, or mood — on RunComfy via the `runcomfy` CLI. Routes to Qwen Edit 2509's dedicated `relight` LoRA endpoint for purpose-built relighting, with fallback to identity-preserving edit endpoints (Nano Banana 2 Edit, GPT Image 2 Edit, FLUX Kontext Pro) when prose lighting language is enough. Use for product relighting (studio softbox → window light), portrait mood shift (overcast → golden hour), or color-grade change....

creativeimagemedia

Run any model on RunComfy from the command line. The `runcomfy` CLI is one binary, one auth, hundreds of model endpoints — image generation, image edit, video generation, image-to-video, lip-sync, face swap, video edit, inpainting, outpainting, extend, ControlNet, relight, upscale, LoRA training and more. Submit a request, poll for status, download the output. This skill teaches the agent how to install, authenticate, discover model schemas, invoke models, stream / poll / no-wait, script in...

creativemediaapi

We need to translate the given text from English to Spanish. The text is a description of an agent skill for generating video using ByteDance Seedance 2.0 Pro on RunComfy. We must preserve the name "seedance-v2" and other technical terms like "ByteDance Seedance 2.0 Pro", "RunComfy", "HappyHorse 1.0", "Wan 2.7", "Kling", "CLI", etc. Also preserve numbers, URLs, and protocol names. Do not add any extra commentary or labels. The translation should be natural Spanish. The text inside <text> is: "Generate cinematic short-form video with ByteDance Seedance 2.0 Pro on RunComfy. Documents Seedance 2.0 Pro's strengths (multi-modal references — up to 9 images, 3 videos, 3 audio — synchronized in-pass audio with natural lip-sync, cinematic motion refinement), the 4–15s duration schema, and when to route to HappyHorse 1.0

creativevideomedia

Editar video existente en RunComfy — esta habilidad es un enrutador inteligente que empareja la intención del usuario con el modelo de edición adecuado en el catálogo de RunComfy. Selecciona Wan 2.7 Edit-Video (reestilización general / cambio de fondo / cambio de empaque, preservación de identidad y movimiento), Kling 2.6 Pro Motion Control (transferir movimiento preciso desde un video de referencia a un personaje objetivo), o Lucy Edit Restyle (reestilización ligera con preservación de identidad / cambio de atuendo). Agrupa los patrones de indicaciones documentados de cada modelo para que la habilidad...

videocreativemedia

Extend or continue an existing video clip on RunComfy via the `runcomfy` CLI. Routes to Google Veo 3-1's `extend-video` and `fast/extend-video` endpoints — pick the source video plus a prompt describing what should happen next, and the model produces a clip that continues the original with consistent motion, lighting, and subject identity. Use when the user has a short Veo clip and wants it longer, or wants a chained narrative built shot-by-shot from a single seed clip. Triggers on "extend...

videocreativemedia

video-inpainting

Region edits across video frames on RunComfy via the `runcomfy` CLI — remove an object that appears across many frames, clean up wires or watermarks, replace a region with matching motion. Routes across Wan 2-7 edit-video (default, prompt-driven region edits with spatial language), Lucy Edit Restyle (identity-stable region-aware restyle), and Seedream 4-0 edit-sequential (when treating the clip as a frame stack). Picks the right route based on whether the change is prose-driven,...

videocreativemedia

video-outpainting

Video outpainting on RunComfy via the `runcomfy` CLI — extend the spatial canvas of a video, change aspect ratio (9:16 vertical to 16:9 horizontal or vice versa), add environment beyond the original frame while preserving the central action. Routes prompt-shaped spatial extension through Wan 2-7 edit-video and points the agent at dedicated ComfyUI outpaint workflows when seam quality matters for hero delivery. Triggers on "video outpaint", "video outpainting", "extend video canvas", "expand...

videocreativemedia

Generate text-to-video with Wan 2.7 (Wan-AI's flagship motion model) on RunComfy. Documents Wan 2.7's strengths (multi-reference conditioning, audio-driven lip-sync via `audio_url`, smoother transitions, prompt expansion), the duration / resolution / aspect-ratio schema, and when to route to HappyHorse 1.0 / Seedance 2.0 / Kling / LTX 2 instead. Calls `runcomfy run wan-ai/wan-2-7/text-to-video` through the local RunComfy CLI. Triggers on "wan", "wan 2.7", "wan-2-7", "wan video", or any...

creativevideomedia