purdueglobal.edu
robots.txt
Robots Exclusion Standard data for purdueglobal.edu
Resource Scan
Scan Details
Site Domain | purdueglobal.edu |
Base Domain | purdueglobal.edu |
Scan Status | Ok |
Last Scan | 2024-09-22T20:54:56+00:00 |
Next Scan | 2024-10-22T20:54:56+00:00 |
Last Scan
Scanned | 2024-09-22T20:54:56+00:00 |
URL | https://purdueglobal.edu/robots.txt |
Redirect | https://www.purdueglobal.edu/robots.txt |
Redirect Domain | www.purdueglobal.edu |
Redirect Base | purdueglobal.edu |
Domain IPs | 13.107.246.59 |
Redirect IPs | 13.107.246.59, 2620:1ec:bdf::59 |
Response IP | 13.107.246.59 |
Found | Yes |
Hash | 86feb5ea768c32ca72b764834e0933cc29d75f5d6f0d1d90e5c40e130869a44e |
SimHash | 45704550e7d3 |
Groups
*
Rule | Path |
---|---|
Disallow | /uploadedfiles/gao/ |
Disallow | /search-results* |
Disallow | /thank-you* |
Disallow | /*? |
Disallow | /assets/fonts* |
Disallow | /assets/documents/ebooks-guides* |
Disallow | /docs* |
Disallow | /gao/ |
Disallow | /_test/ |
Disallow | /_Test/ |
Disallow | /_status/ |
Disallow | /bot* |
Other Records
Field | Value |
---|---|
sitemap | https://www.purdueglobal.edu/sitemap-page.xml |
sitemap | https://www.purdueglobal.edu/sitemap-pdf.xml |
sitemap | https://www.purdueglobal.edu/sitemap-index.xml |