santacruzmah.org
robots.txt
Robots Exclusion Standard data for santacruzmah.org
Resource Scan
Scan Details
| Site Domain | santacruzmah.org |
| Base Domain | santacruzmah.org |
| Scan Status | Ok |
| Last Scan | 2026-01-06T08:33:57+00:00 |
| Next Scan | 2026-01-13T08:33:57+00:00 |
Last Scan
| Scanned | 2026-01-06T08:33:57+00:00 |
| URL | https://santacruzmah.org/robots.txt |
| Redirect | https://www.santacruzmah.org/robots.txt |
| Redirect Domain | www.santacruzmah.org |
| Redirect Base | santacruzmah.org |
| Domain IPs | 104.21.10.117, 172.67.163.39, 2606:4700:3030::ac43:a327, 2606:4700:3034::6815:a75 |
| Redirect IPs | 104.21.10.117, 172.67.163.39, 2606:4700:3030::ac43:a327, 2606:4700:3034::6815:a75 |
| Response IP | 172.67.163.39 |
| Found | Yes |
| Hash | 508c5a6d61f4678a83ec6ef2dee20bf986123354f08ac2f43413275238021e03 |
| SimHash | 4c509b52b7d2 |
Groups
*
| Rule | Path |
|---|---|
| Disallow | /cpresources/ |
| Disallow | /vendor/ |
| Disallow | /.env |
| Disallow | /test-visualforce |
| Disallow | /test-visualforce-test |
Other Records
| Field | Value |
|---|---|
| sitemap | https://www.santacruzmah.org/sitemaps-1-sitemap.xml |
Comments