D
Skills Doany Ai
ace-step
doany-ai
Generate, inpaint, and outpaint music with ACE Step on RunComfy via the `runcomfy` CLI. ACE Step is StepFun-AI's open-weights music foundation model — tag-driven composition (genre, mood, instruments), multilingual lyrics with section markers, 5 s to 4 min stereo output, $0.0002–0.0003 per second (≈ 27× cheaper than ElevenLabs Music). Four endpoints: ACE Step text-to-audio (the default), ACE Step 1.5 text-to-audio (50+ language lyrics, refined structured-lyric handling), ACE Step...
creativemediaaudio
ai-avatar-video
doany-ai
Create AI avatar, talking-head, and lip-sync videos on RunComfy via the `runcomfy` CLI. Routes across ByteDance OmniHuman (audio-driven full-body avatar), Wan-AI Wan 2-7 (audio-driven mouth sync via `audio_url` on a portrait), HappyHorse 1.0 (Arena #1 t2v / i2v with in-pass audio), and Seedance v2 Pro (multi-modal cinematic with reference audio + reference subject). Picks the right model for the user's actual intent — UGC voiceover, virtual presenter, dubbed product demo, lip-synced...
videocreativemedia
ai-image-generation
doany-ai
Generate and edit images on RunComfy via the `runcomfy` CLI — a smart router across the full image-model catalog: FLUX 2 (Klein 9B/4B, Pro, Dev, Flash, Turbo, Max), Google Nano Banana 2 / Pro, OpenAI GPT Image 2, ByteDance Seedream 5 / 4-5 / 4-0 and Dreamina 4-0, Alibaba Qwen Image and Z-Image Turbo, Wan 2-7. Covers both text-to-image (t2i) and image-to-image / edit (i2i) endpoints — the skill picks the right model for the user's actual intent (typography precision, photoreal portraits,...
creativemediaimage
ai-music
doany-ai
Generate AI music on RunComfy via the `runcomfy` CLI — a smart router across the music-model catalog. Routes to ElevenLabs AI Music Generation (premium 44.1 kHz stereo vocal tracks, 5 s–5 min, $0.0083/s) and ACE Step / ACE Step 1.5 (StepFun-AI open-weights, tag-driven composition, multilingual lyrics, $0.0002–0.0003/s, ~27× cheaper), plus ACE Step audio-inpaint (regenerate a time range inside an existing track) and ACE Step audio-outpaint (extend a track before or after). Picks the right...
creativemediaaudio
ai-video-generation
doany-ai
Generate AI videos on RunComfy via the `runcomfy` CLI — a smart router across the full video-model catalog: HappyHorse 1.0 (Arena #1, native in-pass audio), Wan-AI Wan 2-7 (open weights, audio-driven lip-sync), ByteDance Seedance v2 / 1-5 / 1-0 (multi-modal cinematic), Kling 3.0 / 2-6, Google Veo 3-1, MiniMax Hailuo 2-3, ByteDance Dreamina 3-0. Covers text-to-video (t2v), image-to-video (i2v), and Veo's video-extend endpoint. The skill picks the right model for the user's intent (Arena-#1...
creativevideomedia
controlnet-pose
doany-ai
Pose-conditioned generation on RunComfy via the `runcomfy` CLI. Routes across Kling 2-6 Motion Control Pro / Standard (transfer the motion / blocking of a reference video onto a target character), community Wan 2-2 Animate (audio-driven character animation with pose conditioning), and Z-Image Turbo ControlNet LoRA (pose-conditioned image generation from an OpenPose / DWPose / canny / depth control image). Picks the right route based on video vs still and stylized vs photoreal. Triggers on...
creativemediavideo
elevenlabs-music-generation
doany-ai
Generate full songs and instrumental tracks with ElevenLabs Music on RunComfy via the `runcomfy` CLI. ElevenLabs Music turns a style description plus structured lyrics into studio-quality 44.1 kHz stereo audio — 5 seconds to 5 minutes — with section-level control (Intro / Verse / Chorus / Bridge), multilingual vocals, and commercial-friendly output. Generate a backing track, a full vocal song, a jingle, a podcast intro, a game loop, or an instrumental bed. Calls `runcomfy run...
creativeaudiomedia
face-swap
doany-ai
Swap a face / character into video or images on RunComfy via the `runcomfy` CLI. Routes across community Wan 2-2 Animate (audio-driven character animation + identity swap), GPT Image 2 Edit (single-shot precise face swap on still images via reference composition), Nano Banana Edit (batch identity-preserving swap), Flux Kontext (single-ref high-fidelity local face edit), and Kling 2-6 Motion Control Pro (transfer motion from one performance onto a target character). Picks the right model for...
creativevideoimage
flux-2-klein
doany-ai
Générer des images avec Flux 2 Klein (variante rapide distillée de Flux 2 par Black Forest Labs) sur RunComfy — inclut les schémas de prompt documentés du modèle pour que la compétence produise des résultats plus précis qu’un prompt naïf sur le même modèle. Documente les atouts de Flux 2 Klein (latence inférieure à la seconde, stylisme multi-référence de marque, prompts déclaratifs centrés sur le sujet), la stratégie de nombre d’étapes (4–8 pour itération rapide, ~25 pour finition), le compromis entre les variantes 9B et 4B, et le moment où rediriger vers Flux 2 Pro /...
creativeimagemedia
flux-kontext
doany-ai
Modifiez des images avec Flux 1 Kontext Pro (le modèle d'édition locale précise de Black Forest Labs) sur RunComfy — intégré aux schémas de prompt documentés du modèle pour que la compétence produise des résultats plus précis qu'une sollicitation naïve avec le même modèle. Documente les points forts de Flux Kontext (éditions locales précises à partir d'une seule référence, contrôle strict du prompt, sorties haute-fidélité cohérentes), le schéma (image unique + prompt), et quand rediriger vers Nano Banana Edit / GPT Image 2 edit / Flux 2 Klein à la place. Appelle...
creativeimagedocument
gpt-image-2
doany-ai
Generate and edit images with OpenAI GPT Image 2 (ChatGPT Images 2.0) on RunComfy. Documents GPT Image 2's strengths (embedded text, logos, multilingual typography, instruction precision), its 3 fixed sizes, edit-with-preservation language, and when to route to a sibling (Flux 2 / Nano Banana Pro / Seedream) instead. Calls `runcomfy run openai/gpt-image-2/text-to-image` or `/edit` through the local RunComfy CLI. Triggers on "gpt image 2", "gpt-image-2", "ChatGPT Images 2", "image 2", or any...
creativeimageapi
gpt-image-edit
doany-ai
Edit images with OpenAI GPT Image 2 (the `/edit` endpoint of ChatGPT Images 2.0) on RunComfy — bundled with the model's documented prompting patterns so the skill gets sharper output than naive prompting against the same model. Documents GPT Image Edit's strengths (preservation language, multilingual in-image text editing, multi-reference up to 10 images, layout / typography precision), the schema, and when to route to Nano Banana Edit / Flux Kontext / GPT Image 2 t2i instead. Calls...
imagecreativeapi
happyhorse-1-0
doany-ai
Generate text-to-video with HappyHorse 1.0 on RunComfy. Documents HappyHorse 1.0's strengths (#1 on Artificial Analysis Video Arena, native 1080p with in-pass synchronized audio, multi-shot character consistency, 6-language prompt support), the duration / aspect-ratio / resolution schema, and when to route to Wan 2.7 / Seedance 2 / LTX 2 instead. Calls `runcomfy run happyhorse/happyhorse-1-0/text-to-video` through the local RunComfy CLI. Triggers on "happyhorse", "happy horse", "happyhorse...
creativevideomedia
image-edit
doany-ai
Modifier des images sur RunComfy — cette compétence est un routeur intelligent qui associe l'intention de l'utilisateur au bon modèle d'édition dans le catalogue RunComfy. Sélectionne Nano Banana Edit (lot jusqu'à 20, préservation d'identité par défaut), OpenAI GPT Image 2 Edit (réécriture multilingue de texte dans l'image, composition multi-référence, précision de mise en page), Flux Kontext Pro (édition locale haute-fidélité à référence unique), ou Z-Image Turbo Inpaint (édition précise de région pilotée par masque). Regroupe les schémas de sollicitation documentés de chaque modèle afin que la compétence obtienne...
creativeimagemedia
image-inpainting
doany-ai
Mask-driven image inpainting on RunComfy via the `runcomfy` CLI. Routes to Tongyi MAI Z-Image Turbo Inpainting (the dedicated inpainting endpoint with mask, strength, and control-scale) and to identity-preserving edit models (Nano Banana 2 Edit, GPT Image 2 Edit, FLUX Kontext Pro) when a mask isn't available and the region must be described instead. Use for object removal, watermark removal, region replacement, blemish cleanup, and any controlled local edit where a binary mask defines the...
creativeimagemedia
image-outpainting
doany-ai
Image outpainting on RunComfy via the `runcomfy` CLI — extend a still beyond its original canvas, fill in what the camera didn't capture, change aspect ratio (square → 16:9, portrait → landscape) while preserving the original content. Routes across Nano Banana 2 Edit (default, spatial-language driven), GPT Image 2 Edit (multi-ref with reference-style matching), FLUX Kontext Pro (single-shot maximum-preservation), and the brand edit endpoints (Seedream / Dreamina / Qwen / FLUX 2). Picks the...
creativeimagemedia
image-to-video
doany-ai
Animate any still image on RunComfy — this skill is a smart router that matches the user's intent to the right i2v model in the RunComfy catalog. Picks HappyHorse 1.0 I2V (Arena #1, native audio, identity preservation) for general animations, Wan 2.7 with `audio_url` for custom-voiceover lip-sync, or Seedance 2.0 Pro for multi-modal animation from image + reference video + reference audio. Bundles each model's documented prompting patterns so the caller gets sharper output without burning...
creativevideomedia
kling-3-0
doany-ai
Génération vidéo Kling 3.0 sur RunComfy. Kling 3.0 (aussi appelé Kling V3.0) est le modèle vidéo multi-plan de troisième génération de Kuaishou Technology, avec audio natif synchronisé et identité de personnage cohérente entre les plans. Cette compétence couvre les six points de terminaison de Kling 3.0, répartis sur trois niveaux de rendu (Standard, Pro, 4K) et deux modes (texte-vers-vidéo, image-vers-vidéo). Les appels runcomfy run kling/kling-3.0/ / via l'interface CLI locale RunComfy. Déclencheurs sur "kling", "kling 3.0", "kling v3", "kling pro",...
videocreativemedia
lipsync
doany-ai
Lip-sync a face to a specific audio track on RunComfy via the `runcomfy` CLI. Routes across ByteDance OmniHuman (audio-driven full-body avatar from a portrait + audio), Sync Labs sync v2 / Pro (state-of-the-art mouth sync onto a video), Kling lipsync (audio-to- video and text-to-video with synced speech), and Creatify lipsync. The skill picks the right endpoint for the user's actual intent — portrait still + audio (avatar-style), source video + audio (mouth- swap on existing footage), or...
creativevideomedia
nano-banana-2
doany-ai
Générer des images avec Google Nano Banana 2 (text-to-image de niveau flash de la famille Gemini) sur RunComfy — intégré aux schémas de prompting documentés du modèle, afin que la compétence produise des résultats plus précis qu’un prompting naïf avec le même modèle. Documente les points forts de Nano Banana 2 (itération rapide, rendu typographique dans l’image, cadrage prévisible, contexte web facultatif), la tarification par palier de résolution, le curseur de tolérance de sécurité, et quand orienter vers Nano Banana Pro / GPT Image 2 / Flux 2 / Seedream...
creativeimageresearch
nano-banana-edit
doany-ai
Edit images with Google Nano Banana 2 (image-to-image edit endpoint) on RunComfy. Documents Nano Banana Edit's strengths (preserve subject identity, swap background, localize edits with spatial language, multi-image batch edits up to 20 inputs), the schema, and when to route to GPT Image 2 edit / Flux Kontext / Nano Banana 2 t2i instead. Calls `runcomfy run google/nano-banana-2/edit` through the local RunComfy CLI. Triggers on "nano banana edit", "edit with nano banana", "image edit nano...
creativeimageapi
relight
doany-ai
Relight a still image — change the lighting setup, color temperature, direction, or mood — on RunComfy via the `runcomfy` CLI. Routes to Qwen Edit 2509's dedicated `relight` LoRA endpoint for purpose-built relighting, with fallback to identity-preserving edit endpoints (Nano Banana 2 Edit, GPT Image 2 Edit, FLUX Kontext Pro) when prose lighting language is enough. Use for product relighting (studio softbox → window light), portrait mood shift (overcast → golden hour), or color-grade change....
creativeimagemedia
runcomfy-cli
doany-ai
Run any model on RunComfy from the command line. The `runcomfy` CLI is one binary, one auth, hundreds of model endpoints — image generation, image edit, video generation, image-to-video, lip-sync, face swap, video edit, inpainting, outpainting, extend, ControlNet, relight, upscale, LoRA training and more. Submit a request, poll for status, download the output. This skill teaches the agent how to install, authenticate, discover model schemas, invoke models, stream / poll / no-wait, script in...
creativemediaapi
seedance-v2
doany-ai
Generate cinematic short-form video with ByteDance Seedance 2.0 Pro on RunComfy. Documents Seedance 2.0 Pro's strengths (multi-modal references — up to 9 images, 3 videos, 3 audio — synchronized in-pass audio with natural lip-sync, cinematic motion refinement), the 4–15s duration schema, and when to route to HappyHorse 1.0 / Wan 2.7 / Kling instead. Calls `runcomfy run bytedance/seedance-v2/pro` through the local RunComfy CLI. Triggers on "seedance", "seedance 2", "seedance v2", "seedance...
videocreativemedia
video-edit
doany-ai
Modifier une vidéo existante sur RunComfy — cette compétence est un routeur intelligent qui associe l'intention de l'utilisateur au bon modèle d'édition dans le catalogue RunComfy. Sélectionne Wan 2.7 Edit-Video (restylage général / échange d'arrière-plan / échange d'emballage, préservation de l'identité et du mouvement), Kling 2.6 Pro Motion Control (transférer un mouvement précis d'une vidéo de référence à un personnage cible), ou Lucy Edit Restyle (restylage léger avec préservation de l'identité / échange de tenue). Regroupe les schémas de sollicitation documentés de chaque modèle afin que la compétence...
videocreativemedia
video-extend
doany-ai
Extend or continue an existing video clip on RunComfy via the `runcomfy` CLI. Routes to Google Veo 3-1's `extend-video` and `fast/extend-video` endpoints — pick the source video plus a prompt describing what should happen next, and the model produces a clip that continues the original with consistent motion, lighting, and subject identity. Use when the user has a short Veo clip and wants it longer, or wants a chained narrative built shot-by-shot from a single seed clip. Triggers on "extend...
videocreativemedia
video-inpainting
doany-ai
Region edits across video frames on RunComfy via the `runcomfy` CLI — remove an object that appears across many frames, clean up wires or watermarks, replace a region with matching motion. Routes across Wan 2-7 edit-video (default, prompt-driven region edits with spatial language), Lucy Edit Restyle (identity-stable region-aware restyle), and Seedream 4-0 edit-sequential (when treating the clip as a frame stack). Picks the right route based on whether the change is prose-driven,...
videocreativemedia
video-outpainting
doany-ai
Video outpainting on RunComfy via the `runcomfy` CLI — extend the spatial canvas of a video, change aspect ratio (9:16 vertical to 16:9 horizontal or vice versa), add environment beyond the original frame while preserving the central action. Routes prompt-shaped spatial extension through Wan 2-7 edit-video and points the agent at dedicated ComfyUI outpaint workflows when seam quality matters for hero delivery. Triggers on "video outpaint", "video outpainting", "extend video canvas", "expand...
videocreativemedia
wan-2-7
doany-ai
Generate text-to-video with Wan 2.7 (Wan-AI's flagship motion model) on RunComfy. Documents Wan 2.7's strengths (multi-reference conditioning, audio-driven lip-sync via `audio_url`, smoother transitions, prompt expansion), the duration / resolution / aspect-ratio schema, and when to route to HappyHorse 1.0 / Seedance 2.0 / Kling / LTX 2 instead. Calls `runcomfy run wan-ai/wan-2-7/text-to-video` through the local RunComfy CLI. Triggers on "wan", "wan 2.7", "wan-2-7", "wan video", or any...
creativevideomedia