usca.edu
robots.txt
Robots Exclusion Standard data for usca.edu
Resource Scan
Scan Details
Site Domain | usca.edu |
Base Domain | usca.edu |
Scan Status | Ok |
Last Scan | 2024-09-23T16:36:23+00:00 |
Next Scan | 2024-10-23T16:36:23+00:00 |
Last Scan
Scanned | 2024-09-23T16:36:23+00:00 |
URL | https://usca.edu/robots.txt |
Domain IPs | 104.22.38.138, 104.22.39.138, 172.67.38.134, 2606:4700:10::6816:268a, 2606:4700:10::6816:278a, 2606:4700:10::ac43:2686 |
Response IP | 104.22.38.138 |
Found | Yes |
Hash | 968eb209d4f42593792d1aad5d682f6a76f37cce9da45f425c4db0bb514605e2 |
SimHash | 6019dc1067d3 |
Groups
*
Rule | Path |
---|---|
Disallow | / |
Other Records
Field | Value |
---|---|
sitemap | https://uscaedu-lb01-production.terminalfour.net/sitemap.xml /* It needs to point to the Google sitemap as per the build*/ |