content.rapha.cc
robots.txt

Robots Exclusion Standard data for content.rapha.cc

Resource Scan

Scan Details

Site Domain content.rapha.cc
Base Domain rapha.cc
Scan Status Ok
Last Scan2025-07-01T14:28:58+00:00
Next Scan 2025-07-31T14:28:58+00:00

Last Scan

Scanned2025-07-01T14:28:58+00:00
URL https://content.rapha.cc/robots.txt
Domain IPs 76.76.21.123, 76.76.21.22
Response IP 66.33.60.66
Found Yes
Hash 19c3ed7e9ac4475852c7732d6b6adeecd0a182b0a160f72fd35c1853538b2b1e
SimHash 42519c578590

Groups

*

Rule Path
Allow /

googlebot

Rule Path
Disallow /account
Disallow /login
Disallow /stories/*

Other Records

Field Value
sitemap https://content.rapha.cc/sitemap.xml

Comments

  • *
  • Googlebot
  • Host
  • Sitemaps

Warnings

  • `host` is not a known field.