firecrawl-download

作者： firecrawl

实验性功能。便捷命令，结合地图抓取与页面抓取，将整个网站保存为本地文件。

npx skills add https://github.com/firecrawl/firecrawl-cli --skill firecrawl-download

下载 ZIP GitHub

firecrawl download

Experimental. Convenience command that combines map + scrape to save an entire site as local files.

Maps the site first to discover pages, then scrapes each one into nested directories under .firecrawl/. All scrape options work with download. Always pass -y to skip the confirmation prompt.

When to use

You want to save an entire site (or section) to local files
You need offline access to documentation or content
Bulk content extraction with organized file structure

Quick start

# Interactive wizard (picks format, screenshots, paths for you)
firecrawl download https://docs.example.com

# With screenshots
firecrawl download https://docs.example.com --screenshot --limit 20 -y

# Multiple formats (each saved as its own file per page)
firecrawl download https://docs.example.com --format markdown,links --screenshot --limit 20 -y
# Creates per page: index.md + links.txt + screenshot.png

# Filter to specific sections
firecrawl download https://docs.example.com --include-paths "/features,/sdks"

# Skip translations
firecrawl download https://docs.example.com --exclude-paths "/zh,/ja,/fr,/es,/pt-BR"

# Full combo
firecrawl download https://docs.example.com \
  --include-paths "/features,/sdks" \
  --exclude-paths "/zh,/ja" \
  --only-main-content \
  --screenshot \
  -y

Download options

Option	Description
`--limit <n>`	Max pages to download
`--search <query>`	Filter URLs by search query
`--include-paths <paths>`	Only download matching paths
`--exclude-paths <paths>`	Skip matching paths
`--allow-subdomains`	Include subdomain pages
`-y`	Skip confirmation prompt (always use in automated flows)

Scrape options (all work with download)

-f <formats>, -H, -S, --screenshot, --full-page-screenshot, --only-main-content, --include-tags, --exclude-tags, --wait-for, --max-age, --country, --languages

See also

firecrawl-map — just discover URLs without downloading
firecrawl-scrape — scrape individual pages
firecrawl-crawl — bulk extract as JSON (not local files)

来自 firecrawl 的更多技能

使用oracle CLI的最佳实践（提示词与文件打包、引擎、会话及文件附件模式）。

firecrawl-monitor

检测网站内容变化，并通过webhook或邮件接收通知——无需cron任务、爬虫或差异脚本。当用户想要追踪页面变化、监控竞争对手定价、在新职位或博客发布时接收提醒、监测文档/更新日志/状态页面，或说出“监控”、“关注”、“追踪”、“当……时提醒我”、“当X变化时通知我”、“如果……请通知我”、“当……时发邮件给我”或“当……时发送webhook”时，使用此技能。内置AI判断器会过滤掉格式、时间戳和……

officialweb-scrapingresearch

firecrawl-deep-research

使用 Firecrawl 进行多源深度研究。当用户要求研究某个主题、比较不同观点、生成带来源的简报、调查技术或市场问题，或综合多个来源的网络证据时使用。

officialresearchweb-scraping

firecrawl-research-papers

使用Firecrawl查找并综合研究论文、白皮书、PDF文件、技术报告及学术来源。适用于用户需要文献综述、论文摘要、研究现状分析，或从PDF及学术/行业出版物中获取有来源的综合内容时。

officialresearchweb-scraping

firecrawl-market-research

使用Firecrawl提取市场、财务、收益、行业和公司指标。当用户询问市场研究、行业趋势、上市公司数据、财务比较、收益研究或结构化市场报告时使用。

officialresearchweb-scraping

firecrawl-website-design-clone

使用 Firecrawl 抓取证据，将任意网站的设计系统提取为可供智能体使用的 DESIGN.md 文件。当用户需要从网站获取颜色、字体、间距、组件、布局模式或品牌/UI 指导，以便 AI 智能体创建新网站、克隆外观或受该设计启发构建页面时使用。

officialdesignweb-scraping

firecrawl-knowledge-base

使用Firecrawl从网页内容构建知识库。适用于本地参考文档、RAG就绪文本块、微调数据集、文档镜像、主题语料库，或从网页来源整理的LLM就绪Markdown。

officialweb-scrapingresearch

firecrawl-lead-research

使用Firecrawl生成会前潜在客户情报简报。适用于用户在销售通话、合作会议、投资者对话或客户访谈前需要公司调研、人物调研、最新动态、谈话要点、痛点分析或外联准备时。

officialresearchweb-scraping