Launch Week I / Day 5: Real-Time Crawling with WebSockets

Introducing Browser Sandbox - Give your agents a secure, fully managed browser environment Read more →

Get started

Ready to build?

Start getting Web Data for free and scale seamlessly as your project expands. No credit card needed.

Eric Ciarla

Aug 30, 2024

Launch Week I / Day 5: Real-Time Crawling with WebSockets image

Welcome to Day 5 of Firecrawl's Launch Week! We're excited to introduce an exciting new feature that will bring your web scraping projects to the next level: Real-Time Crawling with WebSockets.

Introducing Crawl URL and Watch

We're thrilled to announce our new WebSocket-based method, Crawl URL and Watch. This powerful feature enables real-time data extraction and monitoring, opening up new possibilities for immediate data processing.

How It Works

The Crawl URL and Watch method initiates a crawl job and returns a watcher object. You can then add event listeners for various events like "document" (when a new page is crawled), "error" (if an error occurs), and "done" (when the crawl is complete).

This approach allows you to process data in real-time, react to errors immediately, and know exactly when your crawl is finished.

We're excited to see how you'll use this new feature to enhance your web scraping projects and create more dynamic, responsive applications.

Check out our documentation for a detailed guide on how to implement Crawl URL and Watch in your projects: Firecrawl WebSocket Documentation

Eric Ciarla @ericciarla

Cofounder and CMO of Firecrawl

About the Author

Eric Ciarla is a co-founder of Firecrawl. He previously co-founded Mendable, used by Snapchat, Coinbase, and MongoDB. He's been building products in the AI and data space since 2022.

More articles by Eric Ciarla

Browser Sandbox: Secure Environments for Agents to Interact with the Web Branding Format v2: Improved Logo Extraction Extract Web Data at Scale With Parallel Agents Introducing the Firecrawl Skill and CLI - Give Agents Real-Time Web Data How Credal Extracts 6M+ URLs Monthly to Power Production AI Agents Introducing Spark 1 Pro and Spark 1 Mini Introducing /agent: Gather Data Wherever It Lives on the Web Retell’s AI phone agents get LLM-ready content from Firecrawl Introducing Firecrawl v2.5 - The World's Best Web Data API Why Firecrawl Beats Octoparse for AI Web Scraping

FOOTER

The easiest way to extract
data from the web

                                                                                                                                                 
                                                                                                                                                 
                                                                                                                                                 
                                                                                                                                                 
                                                                                                                                                 
                                                                .     .                                                                          
                                                               ..     ..+                                                                        
                                                                      .:.                                                                        
                                                               ..     ..         .::                                                             
                                                               +..   ..:          :.                                                             
                                                             .:..::.  ..          ..                                                             
                                                             .--:::.  ..     ...  .:.           ..                                               
                                            ..               .:+=-::.:.     . ...-.::.         ..                                                
                                            ::....           .:--+::..: ......:+....:.     :.. ..                                                
                                            .......            ::-=::::     ..:-:-...:     .--..::          .........                            
                            ..  .             . .              ..::-:-..      .-+-:::..    ...::::.        .: ...::.:..                          
                       .  -... ....:           .   .            .--=+-::.      :-=-:....  .  .:..::      .:---:::::-::....                       
                       ..::........::=.....    ...:-..        .:-=--+=-:.       ..--:..=::.... . .:..  ..:---::::---=:::..:...                   
              ..........::::.:::::::-::.-..  ...::--==:.      ..-::-+==-:...      .-::.......   ..--:. ..:=+==.---=-+-:::::::-..                 
          . .....::......:: ::::-::.---=+-:..::-+==++X=-:.   ..:-::-=-== ---..   .:.--::..       .:-==::=--X==-----====--::+:::+...              
          ..-....-:..::-::=-=-:-::--===++=-==-----== X+=-:.::-==----+==+XX+=-::.:+--==--::.      .:-+X=----+X=-=------===--::-:...:. ....        
          ....::::...:-:-==+++=++==+++XX++==++--+-+==++++=-===+=---:-==+X:XXX+=-:-=-==++=-:.     .:-=+=- -=X+X+===+---==--==--:..::...+....+     
         ..:::---.::.---=+==XXXXXXXX+XX++==++===--+===:+X+====+=--::--=+XXXXXXX+==++==+XX+=: ::::--=+++X++X+XXXX+=----==++.+=--::+::::+. ::.=... 
         .:::-==-------=X+++XXXXXXXXXXX++==++.==-==-:-==+X++==+=-=--=++++X++:X:X+++X+-+X X+=---=-==+=+++XXXXX+XX=+=--=X++XXX==---::-+-::::.:..-..

Backed by

Y Combinator

Linkedin Github YouTube

SOC II · Type 2

AICPA

SOC 2

X (Twitter)

Discord

Products

Playground Agent Pricing Templates Changelog

Use Cases

AI Platforms Lead Enrichment SEO Teams Deep Research Competitive Intelligence

Documentation

Getting started API Reference Integrations Examples SDKs

Company

Blog Careers Firestarters Ambassadors Affiliates Compare Firecrawl Student program