firecrawl-company-directories

Extract structured company lists from directories with Firecrawl. Use for scraping YC, Crunchbase, Product Hunt, G2, startup directories, category directories, or custom company databases into JSON, CSV, CRM-ready lists, or research tables.

npx skills add https://github.com/firecrawl/firecrawl-workflows --skill firecrawl-company-directories

Firecrawl Company Directories

Use this to turn startup or company directories into structured lists.

Onboarding Interview

Infer the directory, filters, result count, and output format from context. If the source is clear, proceed immediately.

Ask at most 1-3 concise questions only if blocked, such as the directory URL/name, required filters, or target result count.

Firecrawl Collection Plan

Use Firecrawl browser when the directory needs filters, pagination, infinite scroll, or profile clicks. Use scrape/map when listings are public and static.

Suggested sources include YC companies, Crunchbase, Product Hunt, G2 categories, or any custom directory URL.

Extraction Fields

Capture fields that are visible:

  • name
  • description
  • industry/category
  • stage/founded/location/team size/funding when visible
  • tags
  • directory profile URL
  • company website URL

Leave unavailable fields blank. Do not infer.

Final Deliverable

# Company Directory Export: [Source]

## Summary
[Filters, count extracted, limitations]

## Companies
[Table or link to JSON/CSV]

## Sources
[Directory pages and profiles used]

## Rerun Inputs
workflow: firecrawl-company-directories
directory: [source]
filters: [criteria]
max_results: [number]
output: [json/csv/markdown]

JSON Shape

Use source, filters, extractedAt, totalResults, and companies[] with name, url, description, industry, stage, founded, location, teamSize, funding, tags, profileUrl, and websiteUrl.

Quality Bar

  • Deduplicate companies.
  • Track pagination progress.
  • Note rate limits, login walls, or CAPTCHA blocks.

More skills from firecrawl

oracle
firecrawl
Best practices for using the oracle CLI (prompt + file bundling, engines, sessions, and file attachment patterns).
official
firecrawl-monitor
firecrawl
Detect when content on a website changes and get notified by webhook or email — no cron jobs, scrapers, or diff scripts required. Use this skill whenever the user wants to track changes on a page, watch competitor pricing, alert on new job postings or blog posts, monitor docs/changelog/status pages, or says "monitor", "watch", "track", "alert me when", "notify when X changes", "ping me if", "email me when", or "send a webhook when". A built-in AI judge filters out formatting, timestamp, and...
officialweb-scrapingresearch
firecrawl-deep-research
firecrawl
Run multi-source deep research with Firecrawl. Use when the user asks to research a topic, compare perspectives, produce a sourced briefing, investigate a technical or market question, or synthesize web evidence across many sources.
officialresearchweb-scraping
firecrawl-research-papers
firecrawl
Find and synthesize research papers, whitepapers, PDFs, technical reports, and academic sources with Firecrawl. Use when the user wants a literature review, paper summary, research landscape, or sourced synthesis from PDFs and scholarly/industry publications.
officialresearchweb-scraping
firecrawl-market-research
firecrawl
Extract market, financial, earnings, industry, and company metrics with Firecrawl. Use when the user asks for market research, industry trends, public company data, financial comparisons, earnings research, or structured market reports.
officialresearchweb-scraping
firecrawl-website-design-clone
firecrawl
Extract any website's design system into an agent-ready DESIGN.md using Firecrawl scrape evidence. Use when the user wants colors, fonts, spacing, components, layout patterns, or brand/UI guidance from a website so AI agents can create new websites, clone a look, or build pages inspired by that design.
officialdesignweb-scraping
firecrawl-knowledge-base
firecrawl
Build a knowledge base from web content with Firecrawl. Use for local reference docs, RAG-ready chunks, fine-tuning datasets, documentation mirrors, topic corpora, or LLM-ready markdown organized from web sources.
officialweb-scrapingresearch
firecrawl-lead-research
firecrawl
Produce pre-meeting lead intelligence briefs with Firecrawl. Use when the user needs company research, person research, recent news, talking points, pain points, or outreach preparation before a sales call, partnership meeting, investor conversation, or customer interview.
officialresearchweb-scraping