firecrawl-parse

作者： firecrawl

將本地文件轉換為磁碟上的乾淨 Markdown 格式。支援 PDF、DOCX、DOC、ODT、RTF、XLSX、XLS、HTML/HTM/XHTML。

npx skills add https://github.com/firecrawl/cli --skill firecrawl-parse

下載 ZIP GitHub

firecrawl parse

Turn a local document into clean markdown on disk. Supports PDF, DOCX, DOC, ODT, RTF, XLSX, XLS, HTML/HTM/XHTML.

When to use

You have a file on disk (not a URL) and want its text as markdown
User drops a PDF/DOCX and asks what it says, or to summarize it
Use scrape instead when the source is a URL

Quick start

Always save to .firecrawl/ with -o — parsed docs can be hundreds of KB and blow up context if streamed to stdout. Add .firecrawl/ to .gitignore.

mkdir -p .firecrawl

# File → markdown
firecrawl parse ./paper.pdf -o .firecrawl/paper.md

# AI summary
firecrawl parse ./paper.pdf -S -o .firecrawl/paper-summary.md

# Ask a question about the doc
firecrawl parse ./paper.pdf -Q "What are the main conclusions?" \
  -o .firecrawl/paper-qa.md

Then head, grep, rg etc., or incrementally read the file - don't load the whole thing at once.

Options

Option	Description
`-S, --summary`	AI-generated summary
`-Q, --query <prompt>`	Ask a question about the parsed content
`-o, --output <path>`	Output file path — always use this
`-f, --format <fmt>`	`markdown` (default), `html`, `summary`
`--timeout <ms>`	Timeout for the parse job
`--timing`	Show request duration

Tips

Quote paths with spaces: firecrawl parse "./My Doc.pdf" -o .firecrawl/mydoc.md.
Max upload size: 50 MB per file.
Credits: ~1 per PDF page; HTML is 1 flat.
Check .firecrawl/ before re-parsing the same file.
To check your credit balance (recommended for batch processing and similar workflows), use the firecrawl credit-usage command.

See also

firecrawl-scrape — same idea for URLs

來自 firecrawl 的更多技能

使用 oracle CLI 的最佳實踐（提示與檔案捆綁、引擎、會話及檔案附加模式）。

firecrawl-monitor

偵測網站內容何時變更，並透過 Webhook 或電子郵件接收通知 — 無需 Cron 任務、爬蟲或比對腳本。當使用者想追蹤頁面變更、監控競爭對手定價、在新職缺或部落格文章出現時收到提醒、監控文件/更新紀錄/狀態頁面，或說出「監控」、「觀察」、「追蹤」、「當...時提醒我」、「當 X 變更時通知我」、「如果...請通知我」、「當...時寄信給我」或「當...時傳送 Webhook」時，請使用此技能。內建的 AI 判斷器會過濾格式、時間戳記及...

officialweb-scrapingresearch

firecrawl-deep-research

使用 Firecrawl 執行多來源深度研究。當使用者要求研究某個主題、比較不同觀點、產出具來源的簡報、調查技術或市場問題，或綜合多個來源的網路證據時使用。

officialresearchweb-scraping

firecrawl-research-papers

使用 Firecrawl 查找並綜合研究論文、白皮書、PDF、技術報告及學術來源。適用於用戶需要文獻回顧、論文摘要、研究現狀分析，或從 PDF 及學術/行業出版物中獲取有來源的綜合資訊時。

officialresearchweb-scraping

firecrawl-market-research

使用 Firecrawl 提取市場、財務、收益、行業及公司指標。適用於用戶查詢市場研究、行業趨勢、上市公司數據、財務比較、收益研究或結構化市場報告時使用。

officialresearchweb-scraping

firecrawl-website-design-clone

使用 Firecrawl 抓取證據，將任何網站的設計系統提取為可供代理程式使用的 DESIGN.md。當使用者需要從網站取得顏色、字型、間距、元件、版面配置模式或品牌/UI 指引，以便 AI 代理程式能建立新網站、複製外觀或根據該設計建構頁面時使用。

officialdesignweb-scraping

firecrawl-knowledge-base

使用 Firecrawl 從網頁內容建立知識庫。適用於本地參考文件、RAG 就緒區塊、微調資料集、文件鏡像、主題語料庫，或從網路來源整理而成的 LLM 就緒 Markdown。

officialweb-scrapingresearch

firecrawl-lead-research

使用 Firecrawl 生成會前潛在客戶情報簡報。適用於用戶在銷售通話、合作會議、投資人對話或客戶訪談前，需要進行公司研究、人物研究、最新新聞、談話要點、痛點分析或外展準備時。

officialresearchweb-scraping