purdueglobal.edu
robots.txt

Robots Exclusion Standard data for purdueglobal.edu

Resource Scan

Scan Details

Site Domain purdueglobal.edu
Base Domain purdueglobal.edu
Scan Status Ok
Last Scan2024-09-22T20:54:56+00:00
Next Scan 2024-10-22T20:54:56+00:00

Last Scan

Scanned2024-09-22T20:54:56+00:00
URL https://purdueglobal.edu/robots.txt
Redirect https://www.purdueglobal.edu/robots.txt
Redirect Domain www.purdueglobal.edu
Redirect Base purdueglobal.edu
Domain IPs 13.107.246.59
Redirect IPs 13.107.246.59, 2620:1ec:bdf::59
Response IP 13.107.246.59
Found Yes
Hash 86feb5ea768c32ca72b764834e0933cc29d75f5d6f0d1d90e5c40e130869a44e
SimHash 45704550e7d3

Groups

*

Rule Path
Disallow /uploadedfiles/gao/
Disallow /search-results*
Disallow /thank-you*
Disallow /*?
Disallow /assets/fonts*
Disallow /assets/documents/ebooks-guides*
Disallow /docs*
Disallow /gao/
Disallow /_test/
Disallow /_Test/
Disallow /_status/
Disallow /bot*

Other Records

Field Value
sitemap https://www.purdueglobal.edu/sitemap-page.xml
sitemap https://www.purdueglobal.edu/sitemap-pdf.xml
sitemap https://www.purdueglobal.edu/sitemap-index.xml