rocketspark.com
robots.txt

Robots Exclusion Standard data for rocketspark.com

Resource Scan

Scan Details

Site Domain rocketspark.com
Base Domain rocketspark.com
Scan Status Ok
Last Scan2025-08-16T17:45:22+00:00
Next Scan 2025-09-15T17:45:22+00:00

Last Scan

Scanned2025-08-16T17:45:22+00:00
URL https://rocketspark.com/robots.txt
Redirect https://www.rocketspark.com/robots.txt
Redirect Domain www.rocketspark.com
Redirect Base rocketspark.com
Domain IPs 104.20.18.26, 172.66.166.232, 2606:4700:10::6814:121a, 2606:4700:10::ac42:a6e8
Redirect IPs 104.20.18.26, 172.66.166.232, 2606:4700:10::6814:121a, 2606:4700:10::ac42:a6e8
Response IP 104.20.18.26
Found Yes
Hash b4a36b9686a67a1bb754ed9fb3bd0e216d0e9a2fff89dc88973aea8a0d6b4f4d
SimHash 7d155900aed5

Groups

*

Rule Path
Disallow /admin$
Disallow /admin/
Disallow /users$
Disallow /users/
Disallow /tools$
Disallow /tools/
Disallow /*.php$
Allow /

oai-searchbot

Rule Path
Allow /

gptbot

Rule Path
Allow /

chatgpt-user

Rule Path
Allow /

chatgpt-user/2.0

Rule Path
Allow /

claudebot

Rule Path
Allow /

claude-web

Rule Path
Allow /

claude-searchbot

Rule Path
Allow /

claude-user

Rule Path
Allow /

anthropic-ai

Rule Path
Allow /

perplexitybot

Rule Path
Allow /

perplexity-user

Rule Path
Allow /

googlebot

Rule Path
Allow /

google-extended

Rule Path
Allow /

google-cloudvertexbot

Rule Path
Allow /

bingbot

Rule Path
Allow /

ccbot

Rule Path
Allow /

duckassistbot

Rule Path
Allow /

youbot

Rule Path
Allow /

amazonbot

Rule Path
Allow /

meta-externalagent

Rule Path
Allow /

meta-externalfetcher

Rule Path
Allow /

Other Records

Field Value
sitemap https://www.rocketspark.com/sitemap.xml
sitemap https://www.rocketspark.com/au/sitemap.xml
sitemap https://www.rocketspark.com/nz/sitemap.xml
sitemap https://www.rocketspark.com/uk/sitemap.xml
sitemap https://www.rocketspark.com/us/sitemap.xml

Comments

  • OpenAI Crawlers
  • Anthropic (Claude) Crawlers
  • Perplexity Crawlers
  • Google AI Crawlers
  • Other Answer Engine Crawlers
  • Meta AI Crawlers