santacruzmah.org
robots.txt
Robots Exclusion Standard data for santacruzmah.org
Resource Scan
Scan Details
Site Domain | santacruzmah.org |
Base Domain | santacruzmah.org |
Scan Status | Ok |
Last Scan | 2025-06-03T19:56:39+00:00 |
Next Scan | 2025-06-17T19:56:39+00:00 |
Last Scan
Scanned | 2025-06-03T19:56:39+00:00 |
URL | https://santacruzmah.org/robots.txt |
Redirect | https://www.santacruzmah.org/robots.txt |
Redirect Domain | www.santacruzmah.org |
Redirect Base | santacruzmah.org |
Domain IPs | 104.21.10.117, 172.67.163.39, 2606:4700:3030::ac43:a327, 2606:4700:3034::6815:a75 |
Redirect IPs | 104.21.10.117, 172.67.163.39, 2606:4700:3030::ac43:a327, 2606:4700:3034::6815:a75 |
Response IP | 104.21.10.117 |
Found | Yes |
Hash | 508c5a6d61f4678a83ec6ef2dee20bf986123354f08ac2f43413275238021e03 |
SimHash | 4c509b52b7d2 |
Groups
*
Rule | Path |
---|---|
Disallow | /cpresources/ |
Disallow | /vendor/ |
Disallow | /.env |
Disallow | /test-visualforce |
Disallow | /test-visualforce-test |
Other Records
Field | Value |
---|---|
sitemap | https://www.santacruzmah.org/sitemaps-1-sitemap.xml |
Comments