firecrawl-agent

作者： firecrawl

AI驱动的自主提取，可从复杂多页网站中提取结构化数据。智能导航网站以定位并提取数据，返回JSON格式结果，支持可选的模式验证。支持自定义JSON模式以实现可预测的结构化输出，未提供模式时则进行自由格式提取。提供两种模型层级（spark-1-mini和spark-1-pro），设有信用额度，并可选择等待内联结果。最适合多页提取任务；对于更简单的抓取，请使用...

npx skills add https://github.com/firecrawl/cli --skill firecrawl-agent

下载 ZIP GitHub

firecrawl agent

AI-powered autonomous extraction. The agent navigates sites and extracts structured data (takes 2-5 minutes).

When to use

You need structured data from complex multi-page sites
Manual scraping would require navigating many pages
You want the AI to figure out where the data lives

Quick start

# Extract structured data
firecrawl agent "extract all pricing tiers" --wait -o .firecrawl/pricing.json

# With a JSON schema for structured output
firecrawl agent "extract products" --schema '{"type":"object","properties":{"name":{"type":"string"},"price":{"type":"number"}}}' --wait -o .firecrawl/products.json

# Focus on specific pages
firecrawl agent "get feature list" --urls "<url>" --wait -o .firecrawl/features.json

Options

Option	Description
`--urls <urls>`	Starting URLs for the agent
`--model <model>`	Model to use: spark-1-mini or spark-1-pro
`--schema <json>`	JSON schema for structured output
`--schema-file <path>`	Path to JSON schema file
`--max-credits <n>`	Credit limit for this agent run
`--wait`	Wait for agent to complete
`--pretty`	Pretty print JSON output
`-o, --output <path>`	Output file path

Tips

Always use --wait to get results inline. Without it, returns a job ID.
Use --schema for predictable, structured output — otherwise the agent returns freeform data.
Agent runs consume more credits than simple scrapes. Use --max-credits to cap spending.
For simple single-page extraction, prefer scrape — it's faster and cheaper.

See also

firecrawl-scrape — simpler single-page extraction
firecrawl-interact — scrape + interact for manual page interaction (more control)
firecrawl-crawl — bulk extraction without AI

来自 firecrawl 的更多技能

使用oracle CLI的最佳实践（提示词与文件打包、引擎、会话及文件附件模式）。

firecrawl-monitor

检测网站内容变化，并通过webhook或邮件接收通知——无需cron任务、爬虫或差异脚本。当用户想要追踪页面变化、监控竞争对手定价、在新职位或博客发布时接收提醒、监测文档/更新日志/状态页面，或说出“监控”、“关注”、“追踪”、“当……时提醒我”、“当X变化时通知我”、“如果……请通知我”、“当……时发邮件给我”或“当……时发送webhook”时，使用此技能。内置AI判断器会过滤掉格式、时间戳和……

officialweb-scrapingresearch

firecrawl-deep-research

使用 Firecrawl 进行多源深度研究。当用户要求研究某个主题、比较不同观点、生成带来源的简报、调查技术或市场问题，或综合多个来源的网络证据时使用。

officialresearchweb-scraping

firecrawl-research-papers

使用Firecrawl查找并综合研究论文、白皮书、PDF文件、技术报告及学术来源。适用于用户需要文献综述、论文摘要、研究现状分析，或从PDF及学术/行业出版物中获取有来源的综合内容时。

officialresearchweb-scraping

firecrawl-market-research

使用Firecrawl提取市场、财务、收益、行业和公司指标。当用户询问市场研究、行业趋势、上市公司数据、财务比较、收益研究或结构化市场报告时使用。

officialresearchweb-scraping

firecrawl-website-design-clone

使用 Firecrawl 抓取证据，将任意网站的设计系统提取为可供智能体使用的 DESIGN.md 文件。当用户需要从网站获取颜色、字体、间距、组件、布局模式或品牌/UI 指导，以便 AI 智能体创建新网站、克隆外观或受该设计启发构建页面时使用。

officialdesignweb-scraping

firecrawl-knowledge-base

使用Firecrawl从网页内容构建知识库。适用于本地参考文档、RAG就绪文本块、微调数据集、文档镜像、主题语料库，或从网页来源整理的LLM就绪Markdown。

officialweb-scrapingresearch

firecrawl-lead-research

使用Firecrawl生成会前潜在客户情报简报。适用于用户在销售通话、合作会议、投资者对话或客户访谈前需要公司调研、人物调研、最新动态、谈话要点、痛点分析或外联准备时。

officialresearchweb-scraping