cml.org
robots.txt

Robots Exclusion Standard data for cml.org

Resource Scan

Scan Details

Site Domain cml.org
Base Domain cml.org
Scan Status Ok
Last Scan2025-10-14T19:33:57+00:00
Next Scan 2025-11-13T19:33:57+00:00

Last Scan

Scanned2025-10-14T19:33:57+00:00
URL https://cml.org/robots.txt
Redirect https://www.cml.org:443/robots.txt
Redirect Domain www.cml.org
Redirect Base cml.org
Domain IPs 3.220.41.178
Redirect IPs 18.208.242.254, 35.170.22.248
Response IP 18.208.242.254
Found Yes
Hash 19ecd75c1640d90895e2a85801f26a37eb63510a837060fc76472049434f95f4
SimHash 6915d9754ff0

Groups

*

Rule Path
Disallow /Sitefinity
Disallow /sandbox
Disallow /search-results

Other Records

Field Value
crawl-delay 120

Comments

  • Do not delete /Sitefinity. Never any reason to allow indexing here
  • The same goes for sandbox
  • Also disallow search. We already have it set to "noindex", but keep getting googlebot hits