collegegazette.com
robots.txt
Robots Exclusion Standard data for collegegazette.com
Resource Scan
Scan Details
Site Domain | collegegazette.com |
Base Domain | collegegazette.com |
Scan Status | Ok |
Last Scan | 2024-10-01T18:59:08+00:00 |
Next Scan | 2024-10-08T18:59:08+00:00 |
Last Scan
Scanned | 2024-10-01T18:59:08+00:00 |
URL | https://collegegazette.com/robots.txt |
Domain IPs | 104.16.150.108, 104.16.151.108, 2606:4700::6810:966c, 2606:4700::6810:976c |
Response IP | 104.16.150.108 |
Found | Yes |
Hash | 388a5e393a08124f0e3b9013164d3b4e8aff484884547ef65928949f15fc89d1 |
SimHash | 896c4880ec92 |
Other Records
Field | Value |
---|---|
sitemap | https://collegegazette.com/sitemap_index.xml |
Comments