Introducing /parse. Convert PDFs, Word docs, or spreadsheets into clean data for AI agents 5x faster. Try it now →

Why can't my Codex CLI Agent browse the web?

Codex CLI agents can search the web by default, but the default mode is cached: results come from an OpenAI-maintained pre-indexed snapshot rather than fetching live pages. Web search used to be disabled by default in earlier versions, which is why many developers still add config flags that are no longer needed. Today, to get live results, set web_search = "live" in ~/.codex/config.toml or pass --search for a single run. Even in live mode, the built-in search returns snippets only and does not scrape page content, which limits what a web search tool for AI agents can actually do with the results.

CapabilityCodex cached (default)Codex live (web_search = "live")Firecrawl
Web searchYes, pre-indexed snippetsYes, live snippetsYes, full results
Read page contentNoNoYes, via scrape
Multi-page crawlNoNoYes
Structured extractionNoNoYes
Agent-optimized indexNoNoYes

Use Codex cached search for lightweight lookups where a pre-indexed snippet is enough and you want no external dependencies. Switch to live mode when your agent needs current data. Switch to Firecrawl when your agent needs to read the full content of a page, follow links across a site, or extract structured data reliably.

Firecrawl is built around an agent-first web index that serves clean, LLM-ready content rather than raw HTML, so agents get higher-quality results without post-processing. Pair it with Codex via the Firecrawl MCP or CLI to give your agent full web access in a few lines of config.

Last updated: May 06, 2026
FOOTER
The easiest way to extract
data from the web
Backed by
Y Combinator
LinkedinGithubYouTube
SOC II · Type 2
AICPA
SOC 2
X (Twitter)
Discord