# /news robots policy — Class B (allow search + answer engines, block training-only) # Traditional search User-agent: Googlebot Allow: / User-agent: Googlebot-Image Allow: / User-agent: Googlebot-News Allow: / User-agent: GoogleOther Allow: / User-agent: Google-CloudVertexBot Allow: / User-agent: Bingbot Allow: / User-agent: BingPreview Allow: / User-agent: msnbot Allow: / User-agent: Applebot Allow: / User-agent: DuckDuckBot Allow: / User-agent: Slurp Allow: / User-agent: YandexBot Allow: / User-agent: Baiduspider Allow: / # AI answer engines (user-triggered retrieval, cites sources) User-agent: OAI-SearchBot Allow: / User-agent: ChatGPT-User Allow: / User-agent: Perplexity-User Allow: / User-agent: DuckAssistBot Allow: / User-agent: YouBot Allow: / User-agent: KagiBot Allow: / User-agent: MistralAI-User Allow: / User-agent: Claude-User Allow: / # Block training-only crawlers User-agent: GPTBot Disallow: / User-agent: Google-Extended Disallow: / User-agent: CCBot Disallow: / User-agent: ClaudeBot Disallow: / User-agent: anthropic-ai Disallow: / User-agent: claude-web Disallow: / User-agent: Bytespider Disallow: / User-agent: Amazonbot Disallow: / User-agent: Applebot-Extended Disallow: / User-agent: Meta-ExternalAgent Disallow: / User-agent: FacebookBot Disallow: / User-agent: cohere-ai Disallow: / User-agent: Diffbot Disallow: / User-agent: omgili Disallow: / User-agent: PerplexityBot Disallow: / # Default User-agent: * Allow: / Sitemap: https://aipulled.com/news/sitemap.xml