AI Crawler
OAI-SearchBot
OpenAI's user-agent for ChatGPT Search retrieval (distinct from GPTBot, which is for training). Allowing this is required for inclusion in ChatGPT Search results.
What it is
OAI-SearchBot is the user-agent OpenAI uses to retrieve and rank pages for ChatGPT Search results, distinct from GPTBot which gathers training data. It fetches content specifically to populate live search answers and their source links.
Why it matters
Inclusion in ChatGPT Search depends on allowing OAI-SearchBot, so blocking it removes your domain from that answer surface even if GPTBot is permitted. Treating the two as interchangeable is a common cause of lost search visibility.
How it works
Configure it independently in robots.txt by naming the OAI-SearchBot user-agent with Allow or Disallow rules. Allowing OAI-SearchBot while controlling GPTBot lets you appear in ChatGPT Search without committing content to training.
When it applies
Allow OAI-SearchBot when you want eligibility in ChatGPT Search results; block it only if you want to be excluded from that surface entirely.
Examples
- robots.txt: User-agent: OAI-SearchBot then Allow: / to stay eligible for ChatGPT Search
- robots.txt combining Allow for OAI-SearchBot with Disallow for GPTBot to permit search but opt out of training
- Server log shows OAI-SearchBot fetching a page shortly before it appears as a ChatGPT Search source
How it is measured
- Daily fetch count from the OAI-SearchBot user-agent
- Share of priority pages fetched by OAI-SearchBot versus total eligible pages
- Correlation between OAI-SearchBot visits and appearance as a ChatGPT Search citation
- HTTP status distribution returned to OAI-SearchBot, watching for inadvertent blocks
Related terms in AI Crawler
- ClaudeBotAnthropic's web crawler. Powers Claude's retrieval and training. Honour robots.txt; access controls are increasingly material to citation share.
- Google-ExtendedGoogle's opt-out signal for using your content in Bard, Gemini, and AI-powered Search features without affecting classical Search ranking. A separate lever from Googlebot.
- GPTBotOpenAI's web crawler. Used to gather training data and to power ChatGPT browsing. Can be allowed or disallowed in robots.txt; blocking may reduce ChatGPT citation eligibility.
- PerplexityBotPerplexity's crawler. Allows real-time retrieval for Perplexity answers. Blocking reduces eligibility for Perplexity citation.