Introducing Search

June 18, 2025

Eric Ciarla imageEric Ciarla

Announcing Firestarter, our open source tool that turns any website into a chatbot

Announcing Firestarter, our open source tool that turns any website into a chatbot image

Every developer, product manager, and support team has felt this pressure. Your documentation is great, your website is packed with information, but users still ask the same questions over and over. 

You know the solution: a smart, AI-powered chatbot that can answer questions based on your content.

But building a production-grade retrieval-augmented generation (RAG) pipeline is a massive undertaking. It involves scraping, cleaning, chunking, embedding, storing, retrieving, and generating. Third-party services can help, but they often come with limitations (like locking you into their ecosystem).

So, we asked ourselves: What if you could create a production-ready RAG pipeline and a ready-to-use chatbot for any website in under 60 seconds?

This is the story of Firestarter.

Enter a URL. Get a chatbot.

Firestarter is a complete Next.js chatbot-building platform that we’ve made as simple as possible. It’s a new addition to our open-source AI toolbox, designed to make chatbot creation accessible to everyone:

  1. Enter any website URL (e.g., https://docs.firecrawl.dev)
  2. Click “Start” and watch as it crawls and indexes the content in real-time
  3. Instantly get a full-featured chat interface and a developer-ready, OpenAI-compatible API

Behind this simple UI is a sophisticated, pre-built RAG pipeline that handles the entire workflow from raw website to intelligent, queryable data source, powered by Firecrawl for web scraping and Upstash for serverless vector search.

The system is architected to let you build and deploy multiple chatbots based on as many websites as you’d like—each one completely independent with its own data namespace and API endpoint.

How it works: From website to wisdom

Firestarter isn’t just a simple script; it’s a complete, two-phase system for creating and interacting with knowledge bases.

Phase 1: The indexing engine

When you enter a URL, Firestarter kicks off a meticulous process:

  1. Smart Crawling: It uses Firecrawl to navigate the site, fetching the clean, structured Markdown content of each page. This bypasses the need to deal with raw HTML, ads, and other noise.
    \
  2. Intelligent Indexing: The content is streamed directly to an Upstash search index. Upstash automatically chunks the text, creates vector embeddings, and stores them in a high-performance, serverless vector database.
    \
  3. Namespace Creation: The entire crawl is isolated under a unique namespace (e.g., firecrawl-dev-1718394041), ensuring each chatbot’s data is separate and secure.
    \

Phase 2: The RAG-Powered Brain

Once indexed, your chatbot is live. When a user asks a question:

  1. Semantic Search: Firestarter queries the Upstash index, searching for the most relevant document chunks based on the meaning of the question, not just keywords.
    \
  2. Context-Aware Prompting: The most relevant chunks are compiled into a context block. This, along with the original question, is sent to your choice of LLM—OpenAI, Anthropic, Groq, or any LLM you like. Built with the Vercel AI SDK, there’s full flexibility to customize and use any LLM provider you decide to use.
    \
  3. Streaming Response: The LLM generates the answer, which is streamed back to the UI in real-time using the Vercel AI SDK, providing a smooth, human-like chat experience.
    \

The best part? Every step is transparent. 

And for developers, Firestarter creates something truly special: an OpenAI-compatible API endpoint for every single chatbot. 

This means you can use the official OpenAI libraries in any language to programmatically query your website’s content, treating it like a structured, intelligent database.

This is just the beginning

Production-grade RAG is complex. Our goal with Firestarter isn’t to be a closed, perfect-out-of-the-box solution.

Instead, we want to build a powerful, open source foundation that anyone can use, understand, and contribute to.

The version you see in this repository is our first major step. It works, it provides immense value, but we know there’s room to grow. By open-sourcing it, we’re inviting you to join us on this journey.

How you can contribute

  • Want to add a new vector DB? Fork the repo and show us what you’ve got
  • Think the RAG prompt can be improved? Open a pull request
  • Have an idea for a new feature? Start a discussion in the issues

We believe that by building in public, we can create a tool that is not only accessible and affordable but also more robust and adaptable, thanks to the collective intelligence of the open-source community.

We built Firestarter with extensibility at its core. Whether you want to swap in your own services, integrate different LLMs, or customize the crawling logic in firestarter.config.ts, we’ve made it easy to adapt the tool to your specific needs.

Get started today

We invite you to explore Firestarter. See what it can do, and then dive into the code to see how it’s done.

We built Firestarter to make custom AI chatbots accessible to everyone. No massive engineering teams required. Just enter a URL and get started.

Let’s build the future of intelligent interfaces. Together.

Ready to Build?

Start scraping web data for your AI apps today.
No credit card needed.

About the Author

Eric Ciarla image
Eric Ciarla@ericciarla

Eric Ciarla is the Chief Operating Officer (COO) of Firecrawl and leads marketing. He also worked on Mendable.ai and sold it to companies like Snapchat, Coinbase, and MongoDB. Previously worked at Ford and Fracta as a Data Scientist. Eric also co-founded SideGuide, a tool for learning code within VS Code with 50,000 users.

More articles by Eric Ciarla

How to Create an llms.txt File for Any Website

Learn how to generate an llms.txt file for any website using the llms.txt Generator and Firecrawl.

Announcing Firestarter, our open source tool that turns any website into a chatbot

Spin up a fully functional RAG chatbot from any website URL using Firecrawl and Upstash—clean markdown in, OpenAI-compatible API out, all in under a minute.

Building Fire Enrich, our open source data enrichment tool

See how we built Fire Enrich, an open source tool that uses Firecrawl, OpenAI, and a multi-agent system to automate data enrichment — fully transparent, extensible, and built for developers.

Cloudflare Error 1015: How to solve it?

Cloudflare Error 1015 is a rate limiting error that occurs when Cloudflare detects that you are exceeding the request limit set by the website owner.

Build an agent that checks for website contradictions

Using Firecrawl and Claude to scrape your website's data and look for contradictions.

Why Companies Need a Data Strategy for Generative AI

Learn why a well-defined data strategy is essential for building robust, production-ready generative AI systems, and discover practical steps for curation, maintenance, and integration.

Getting Started with OpenAI's Predicted Outputs for Faster LLM Responses

A guide to leveraging Predicted Outputs to speed up LLM tasks with GPT-4o models.

How to easily install requests with pip and python

A tutorial on installing the requests library in Python using various methods, with usage examples and troubleshooting tips