curiosus.app
robots.txt

Robots Exclusion Standard data for curiosus.app

Resource Scan

Scan Details

Site Domain curiosus.app
Base Domain curiosus.app
Scan Status Ok
Last Scan2025-12-10T22:13:35+00:00
Next Scan 2025-12-17T22:13:35+00:00

Last Scan

Scanned2025-12-10T22:13:35+00:00
URL https://curiosus.app/robots.txt
Domain IPs 185.22.110.122
Response IP 185.22.110.122
Found Yes
Hash 2803ece8a6c9ce60d5621369e4b8384d30a670a54a9614524063d9024b2ed6f3
SimHash 66128811e5a5

Groups

*

Rule Path
Allow /

Other Records

Field Value
crawl-delay 1

googlebot

Rule Path
Allow /

bingbot

Rule Path
Allow /

slurp

Rule Path
Allow /

duckduckbot

Rule Path
Allow /

gptbot

Rule Path
Allow /

chatgpt-user

Rule Path
Allow /

ccbot

Rule Path
Allow /

anthropic-ai

Rule Path
Allow /

claude-web

Rule Path
Allow /

google-extended

Rule Path
Allow /

Other Records

Field Value
sitemap https://curiosus.app/sitemap.xml

Comments

  • robots.txt for Curiosus
  • Sitemap location
  • Crawl-delay for polite bots
  • Allow all major search engines and AI crawlers
  • AI Crawlers