HomeLaunchesFirecrawl
12

/extract by Firecrawl - Get structured website data with just a prompt

Turn entire websites into structured data with AI

Hey everyone! We’re Eric, Caleb, and Nick from Firecrawl (S22). Today, we’re launching /extract — an endpoint that turns entire websites into structured data with a prompt.

TL;DR

With Firecrawls' new /extract endpoint, any website can be turned into structured data with a simple API call and prompt. We handle the complexity so you can focus on building your company.

The Problem.

If you need to pull data from websites - maybe to enrich your CRM, track competitors, or onboard users - you're stuck with:

  1. Manually researching and copy-pasting from multiple sources \
  2. Building and maintaining scrapers that break at the slightest site change
  3. Stitching together scraping services and complex LLM pipelines with limited context windows

    Each approach wastes the engineering time you could spend shipping a product. :

Our Solution:

/extract is an API that turns a prompt into structured web data.

Here's how to use it:

  1. Give us URLs + Prompt
    Write what data you want, and point us at websites. Use wildcards like example.com/* to scan entire sites.
  2. We Find Relevant Content
    Our crawler finds and ranks the pages that matter, automatically.
  3. AI Extracts Data
    Intelligent agents split, search and parallelize the work, handling sites of any size.
  4. Get Clean JSON
    Receive structured data ready to use - no post-processing needed.
  5. Integrate anywhere via API
    With our API, you can use firecrawl anywhere, whether its in your applications or no-code tools like Zapier

Why It Works

  • Handle Any Website: Built on proven scraping infrastructure that just works
  • Natural Language Input: Describe what you want in plain English - we figure out the schema
  • No Size Limits: Process massive sites by automatically splitting the work
  • Use It Anywhere: Full API + ready-made integrations for Python, Node, and Zapier

Limitations - (and the road ahead)

Let's be honest - while /extract is pretty awesome at grabbing web data, it's not perfect yet. Here's what we're still working on:

  1. Big sites are tricky - It can't (yet!) grab every single product on Amazon in one go
  2. Complex searches need work - Things like "find all posts posted after 2024" aren't quite there
  3. Sometimes, it's a bit quirky - Results can vary between runs, though it usually gets what you need

But here's the exciting part: we're seeing the future of web scraping take shape.

Get Started

  1. Try the Open Beta
  2. Join Our Community

Ready to turn web data into your competitive advantage? Get started in less than 5 minutes.

Get your API key at www.firecrawl.dev/app

— Eric, Caleb, and Nick at Firecrawl 🔥