firecrawl-knowledge-base

Construa uma base de conhecimento a partir de conteúdo web com Firecrawl. Use para documentação de referência local, chunks prontos para RAG, conjuntos de dados para fine-tuning, espelhos de documentação, corpora de tópicos ou markdown organizado de fontes web pronto para LLM.

npx skills add https://github.com/firecrawl/firecrawl-workflows --skill firecrawl-knowledge-base

Firecrawl Knowledge Base

Use this to turn URLs or topics into organized LLM-ready content.

Onboarding Interview

Infer the source, goal, depth, and output location from context. If the source and goal are clear, proceed immediately.

Ask at most 1-3 concise questions only if blocked, such as the source URL/topic, whether the output is reference/RAG/training/docs, or training format if training is requested.

Firecrawl Collection Plan

Use Firecrawl map for documentation sites, search for topic-based corpora, scrape pages into markdown, and preserve code examples and tables.

For files, follow the Firecrawl download-style convention:

.firecrawl/
  <hostname>/
    <path>/
      index.md

Parallel Work

If appropriate, use sub-agents or equivalent parallel task runners:

  • one docs section per researcher
  • official docs, tutorials, community discussions, and references by source type
  • source scraping vs chunk generation vs manifest generation

Output Modes

  • Reference: markdown files, index.md, and sources.json.
  • RAG: markdown files plus chunk files and manifest.json.
  • Training: scraped source files plus training-data.jsonl and training-metadata.json.
  • Docs mirror: complete markdown mirror with a table of contents.

Final Deliverable

# Knowledge Base: [Source]

## Summary
[What was collected and why]

## Output Structure
[Files/directories created]

## Coverage
[Sections, source types, counts]

## Usage Notes
[How to use in RAG, docs, training, or agent context]

## Sources
[URLs collected]

## Rerun Inputs
workflow: firecrawl-knowledge-base
source: [url/topic]
goal: [reference/rag/train/docs]
depth: [quick/thorough/exhaustive]
output_dir: [.firecrawl/]

Quality Bar

  • Preserve code examples and formatting.
  • Remove boilerplate navigation where possible.
  • Include source URLs in frontmatter or metadata.

Mais skills de firecrawl

oracle
firecrawl
Melhores práticas para usar a CLI do oracle (prompt + agrupamento de arquivos, engines, sessões e padrões de anexo de arquivos).
official
firecrawl-monitor
firecrawl
Detecte quando o conteúdo de um site muda e receba notificações por webhook ou e-mail — sem necessidade de cron jobs, scrapers ou scripts de diff. Use esta habilidade sempre que o usuário quiser rastrear alterações em uma página, monitorar preços de concorrentes, ser alertado sobre novas vagas de emprego ou postagens em blogs, acompanhar páginas de documentação/changelog/status, ou disser "monitore", "acompanhe", "rastreie", "me avise quando", "notifique quando X mudar", "me avise se", "me envie um e-mail quando" ou "envie um webhook quando". Um juiz de IA integrado filtra formatação, carimbo de data/hora e...
officialweb-scrapingresearch
firecrawl-deep-research
firecrawl
Run multi-source deep research with Firecrawl. Use when the user asks to research a topic, compare perspectives, produce a sourced briefing, investigate a technical or market question, or synthesize web evidence across many sources.
officialresearchweb-scraping
firecrawl-research-papers
firecrawl
Encontre e sintetize artigos de pesquisa, whitepapers, PDFs, relatórios técnicos e fontes acadêmicas com o Firecrawl. Use quando o usuário desejar uma revisão de literatura, resumo de artigo, panorama de pesquisa ou síntese com fontes de PDFs e publicações acadêmicas/industriais.
officialresearchweb-scraping
firecrawl-market-research
firecrawl
Extract market, financial, earnings, industry, and company metrics with Firecrawl. Use when the user asks for market research, industry trends, public company data, financial comparisons, earnings research, or structured market reports.
officialresearchweb-scraping
firecrawl-website-design-clone
firecrawl
Extract any website's design system into an agent-ready DESIGN.md using Firecrawl scrape evidence. Use when the user wants colors, fonts, spacing, components, layout patterns, or brand/UI guidance from a website so AI agents can create new websites, clone a look, or build pages inspired by that design.
officialdesignweb-scraping
firecrawl-lead-research
firecrawl
Produce pre-meeting lead intelligence briefs with Firecrawl. Use when the user needs company research, person research, recent news, talking points, pain points, or outreach preparation before a sales call, partnership meeting, investor conversation, or customer interview.
officialresearchweb-scraping
firecrawl-dashboard-reporting
firecrawl
Extraia métricas de painéis analíticos e ferramentas web internas com o navegador Firecrawl. Use quando o usuário precisar de relatórios de painéis, resumos de métricas entre plataformas, extração autenticada de análises, relatórios por período ou métricas estruturadas de painéis web.
officialbrowser-automationdata-analysis