website-to-video

Captura un sitio web/URL general y conviértelo en un video de HyperFrames (recorrido del sitio, presentación o clip social a partir de los propios elementos visuales del sitio). Utiliza capturas de pantalla de Chrome sin interfaz gráfica + activos de marca. Úsalo cuando la intención sea general: presentación de portafolio/blog/página de aterrizaje o clip social del sitio. NO para: lanzamiento o promoción de producto/SaaS (→ /product-launch-video, incluso desde una URL); explicador de tema sin sitio (→ /faceless-explainer); PR de GitHub (→ /pr-to-video); agregar subtítulos a un video existente (→...

npx skills add https://github.com/heygen-com/hyperframes --skill website-to-video

Website to HyperFrames

Capture a website, then produce a professional video from it.

Confirm the route before Step 0. This skill makes a video of / from a general site. If the user is really marketing / launching / promoting a product (even from this URL, even "promo for our site") → /product-launch-video. A topic explainer with no site/faceless-explainer; a GitHub PR/pr-to-video; re-cutting / recoloring / reordering an existing video file → out of scope. Routed here on a vague "make a video", or unsure launch-vs-general-site? Read /hyperframes first (full routing table + § What HyperFrames cannot do).

Users say things like:

  • "Turn this website into a 15-second social clip for Instagram"
  • "Make a 30-second site tour / showcase from https://..."
  • "Capture our homepage and build a video from its own visuals"

The workflow has 7 steps. Each produces an artifact that gates the next. By default it's collaborative — gates marked 💬 stop and ask the user. If the user signals autonomous mode ("decide for me", "surprise me"), 💬 user-preference gates are skipped; see step-2-brief.md for how that propagates.

Autonomous mode is NOT "skip all gates." Auto mode covers user-preference questions (TTS provider, voice, color emphasis, beat count, music yes/no, captions yes/no — where the agent decides on the user's behalf). It does NOT cover quality-verification gates. The following remain non-skippable in auto mode:

  • Asset Audit (Step 3) — viewing contact sheets and justifying USE/SKIP for each asset
  • Per-beat HTML read (Step 5) — structured evidence block per beat
  • DoD checklist (Step 6) — including animation-map, per-warning WCAG verification, audio/motion playback
  • Honest disclosure section (Step 6) — "What I did NOT verify" must appear in your final summary

If you find yourself reasoning "auto mode says bias toward action, so I'll skip X" — and X is a verification gate, not a preference question — that reasoning is wrong. Bias toward action applies to deciding what to build, not to deciding whether to verify.


Step 0: Capture & Understand the Brand

Read: references/step-0-capture.md

Capture the site, then read the extracted data to understand the brand and product — what it does, who it's for, what voice it speaks in, what mood it lives in. The captured assets are a brand toolkit for later, not the building blocks the video is made from.

Gate: Site summary printed — strategy-first (what the product does, who it's for, brand voice) before the asset / color / font inventory.


Step 1: Brand Identity

Read: references/step-1-design.md

Write DESIGN.md — a brand cheat sheet covering the visual identity: colors, typography, component styles, layout principles. Use design-styles.json for exact computed values.

Speed option: For fast-pacing videos (billboard-per-beat), DESIGN.md can be a 50-line summary of colors + fonts + do's/don'ts — not a 300-line document. The sub-agent prompt in Step 5 pastes brand values directly, so DESIGN.md depth only matters for complex compositions.

Gate: DESIGN.md exists (any length) with at minimum: color palette, font choices, and do's/don'ts.


Step 2: Strategy & Messaging

Read: references/step-2-brief.md, references/capabilities.md (scan the Table of Contents — deep-dive sections only as needed)

Align with the user on what the video must communicate before talking visuals or assets. Parse the user's prompt — they probably already gave you the video type and style. Ask only what's missing: the ONE thing this video must say, the narrative arc, and the audience.

Gate: Video type, duration, format, and — critically — the message and narrative arc are locked. Without those, Step 3 can't write a concept-first storyboard.


Step 3: Storyboard + Script 💬

Read: references/step-3-storyboard.md

Write the storyboard concept-first: message → narrative arc → beats that serve the arc → techniques per beat → brand accents pass at the end. Then write the narration script to match. Present both to the user with a beat-by-beat summary. Iterate until they approve.

Gate: STORYBOARD.md + SCRIPT.md exist AND the user has approved the plan.


Step 4: VO, Timing + Captions 💬

Read: references/step-4-vo.md

If Step 2 said no narration — ask about background music, then skip to Step 5. Otherwise: ask the user which TTS provider (HeyGen TTS, ElevenLabs, or Kokoro), generate audio, transcribe, map timestamps to beats. Then ask about captions.

Gate: Either (a) no narration was requested and storyboard has manual beat timings, or (b) narration.wav + transcript.json exist and beat timings updated with real durations.


Step 5: Build Compositions

Read: The hyperframes skill (load it — every rule matters) Read: references/step-5-build.md

Build index.html and compositions following the architecture and pacing chosen in the storyboard (Step 3). Sub-agents run hyperframes lint and hyperframes snapshot on each beat before reporting back.

Gate: Every compositions/beat-N.html has been read top-to-bottom by the main agent against DESIGN.md and STORYBOARD.md. The per-beat checklist lives in step-5-build.md.


Step 6: Validate & Deliver

Read: references/step-6-validate.md

Lint, validate, take snapshots scaled to video length (formula: max(beats × 3, ceil(duration_seconds / 2))), and review each one. Fix issues before delivering. Deliver the localhost Studio project URL — only render to MP4 on explicit user request. Surface that Studio URL only at handoff — it is the final, stable preview; the build-phase snapshots are headless, so do not pop a preview mid-build.

Deliver something you're proud of. Before handing off, ask yourself: would I post this on social media with my name on it? If not, fix what's wrong.

Gate: npx hyperframes lint and npx hyperframes validate pass with zero errors, and the final response includes the active Studio project URL.


Quick Reference

Video Types

Typical constraints by video type — use as a starting point, not a formula. Beat count should follow from the content and the narration, not from a target range.

TypeTypical durationDuration driverNarration
Social ad (IG/TikTok)10–15sPlatform limitOptional
Product demo30–60sScript lengthFull narration
Feature announcement15–30sFeature complexityFull narration
Brand reel20–45sMusic trackOptional, music focus
Launch teaser10–20sHook energyMinimal

Beat count is not in this table intentionally — it should come from the storyboard, not from "social ad = 3-4 beats." A social ad for a complex product might need 5 well-timed beats. A brand reel with one strong visual thesis might need 3.

Format

  • Landscape: 1920x1080 (default)
  • Portrait: 1080x1920 (Instagram Stories, TikTok)
  • Square: 1080x1080 (Instagram feed)

Reference Files

FileWhen to read
step-0-capture.mdStep 0 — capture, understand the brand and product, write strategy-first site summary
step-1-design.mdStep 1 — write DESIGN.md brand cheat sheet (5 sections, 250-350 lines; 50-line fast-path for billboard-style social ads)
step-2-brief.mdStep 2 — align on message, narrative arc, audience with user
capabilities.mdSteps 2 & 5 — full inventory of what HyperFrames can do (24 sections). Scan the TOC during the brief, deep-dive specific sections during build
step-3-storyboard.mdStep 3 — storyboard + script (combined) with user review gate
step-4-vo.mdStep 4 — TTS provider choice, generation, timing
step-5-build.mdStep 5 — build index.html + compositions
step-6-validate.mdStep 6 — lint, validate, snapshots (scaled to video length), preview
techniques.mdSteps 3 & 5 — 13 primitive animation techniques with code patterns (adapt, don't copy-paste)
html-in-canvas-patterns.mdStep 5 — complete code patterns for HTML-in-Canvas effects (lives in the hyperframes skill)

Más skills de heygen-com

hyperframes-cli
heygen-com
HyperFrames CLI dev loop — `npx hyperframes` for scaffolding (init), validation (lint, inspect), preview, render, and environment troubleshooting (doctor, browser, info, upgrade). Use when running any of these commands or troubleshooting the HyperFrames build/render environment. For asset preprocessing commands (`tts`, `transcribe`, `remove-background`), invoke the `hyperframes-media` skill instead.
developmenttestingapi
hyperframes-animation
heygen-com
Todo el conocimiento de animación para HyperFrames: reglas de movimiento atómico, planos de escenas multifásicas, transiciones de escenas, técnicas más amplias de diseño de movimiento Y los siete adaptadores de tiempo de ejecución (GSAP predeterminado, más Lottie, Three.js, Anime.js, keyframes CSS, Web Animations API, TypeGPU). Úselo para cualquier tarea de movimiento o animación: elija de 2 a 4 reglas y componga, o cargue un plano, o consulte la API específica del tiempo de ejecución (por ejemplo, suavizados de GSAP / reproductor de Lottie / mezclador de Three.js). Nativo de HyperFrames: línea de tiempo única en pausa, seguro de búsqueda,...
creativedevelopmentdesign
hyperframes-core
heygen-com
HyperFrames HTML composition contract. Use for composition structure, data attributes, clips, tracks, sub-compositions, variables, media playback, deterministic render rules, and validation of minimal renderable projects.
developmentmediacreative
hyperframes-media
heygen-com
Preprocesamiento de activos para composiciones HyperFrames — TTS multiproveedor (HeyGen / ElevenLabs / Kokoro local), BGM multiproveedor (Google Lyria / MusicGen local), transcripción Whisper, eliminación de fondo y creación de subtítulos. Úselo para npx hyperframes tts, bgm, transcribe, remove-background, selección de voz/proveedor, indicación de estado de ánimo musical, subtítulos / letras / karaoke / estilo por palabra.
mediaaudiovideo
hyperframes-registry
heygen-com
Instalar y conectar bloques y componentes del registro en composiciones de HyperFrames. Usar al ejecutar hyperframes add, al instalar un bloque o componente, al conectar un elemento instalado en index.html, o al trabajar con hyperframes.json. Cubre el comando add, ubicaciones de instalación, conexión de subcomposiciones de bloques, fusión de fragmentos de componentes, descubrimiento del registro y creación de un nuevo bloque o componente para contribuir upstream (idea → andamiaje → validación → PR).
developmentapicode-review
general-video
heygen-com
Úsalo como respaldo para la creación de composiciones de video HTML con HyperFrames personalizadas cuando ningún flujo de trabajo especializado se ajuste. Cubre piezas largas o de múltiples escenas, reels de marca o sizzle, montajes, tarjetas de título, pósters en movimiento de larga duración, bucles estáticos y composiciones libres de cualquier duración o formato. No es para promociones de productos comercializados (product-launch-video), captura general de sitio web a video (website-to-video), explicadores de temas (faceless-explainer), videos de PR de GitHub (pr-to-video), subtitulado de material existente...
videocreativemedia
motion-graphics
heygen-com
Úsalo cuando el usuario quiera un motion graphic breve y centrado en el diseño, donde el movimiento sea el mensaje: tipografía cinética, conteo de estadísticas o números, impacto de gráfico/dato, sting de logotipo, bloque de marca, tercio inferior, llamada, superposición social, titular animado/tweet/noticia, póster en movimiento o resalte rápido de página capturada. Generalmente de menos de 10 segundos y hasta ~30 segundos, sin arco narrativo, locución ni sujeto de acción real. Puede renderizarse en MP4 o superposición transparente. No es para piezas más largas, de múltiples escenas, narradas o de reel de marca...
creativevideodesign
hyperframes-read-first
heygen-com
Comienza AQUÍ para cualquier solicitud de hacer, crear, generar, editar, animar o renderizar un video, animación, gráfico en movimiento, video explicativo, tarjeta de título, superposición, video con subtítulos, promoción de producto, video para sitio web, video de relaciones públicas o registro de cambios, montaje de datos, póster animado o composición HTML de HyperFrames. Úsalo antes que otras habilidades de video o animación cuando el usuario quiera que HyperFrames cree o renderice un video MP4/web terminado, elija un flujo de trabajo, o decida entre video de lanzamiento de producto, video explicativo sin rostro, video de sitio web a video,...
creativevideomedia