wgu.edu
robots.txt
Robots Exclusion Standard data for wgu.edu
Resource Scan
Scan Details
Site Domain | wgu.edu |
Base Domain | wgu.edu |
Scan Status | Ok |
Last Scan | 2024-09-14T20:17:55+00:00 |
Next Scan | 2024-10-14T20:17:55+00:00 |
Last Scan
Scanned | 2024-09-14T20:17:55+00:00 |
URL | https://wgu.edu/robots.txt |
Redirect | https://www.wgu.edu/robots.txt |
Redirect Domain | www.wgu.edu |
Redirect Base | wgu.edu |
Domain IPs | 151.101.194.224 |
Redirect IPs | 151.101.130.224, 151.101.194.224, 151.101.2.224, 151.101.66.224 |
Response IP | 199.232.46.224 |
Found | Yes |
Hash | d554a429570d8ce30aff2ecedff57a1567c48920d1f1fe887aaf0187a59e724a |
SimHash | 081d8f068bc0 |
Groups
*
Rule | Path |
---|---|
Disallow | /etc/segmentation |
Disallow | /*jcr%3Acontent |
Disallow | /bin |
Disallow | /content/wgu-marketing/en/search.html |
Disallow | /search.html |
Disallow | /content/wgu-marketing/en/tools.html |
Disallow | /content/wgu-marketing/en/tools |
Disallow | /tools.html |
Disallow | /tools |
Disallow | *.print.html |
Disallow | *.frame.html |
Disallow | /content/wgu-shared |
Disallow | cm.wgu.edu |
Other Records
Field | Value |
---|---|
sitemap | https://www.wgu.edu/sitemap.xml |
sitemap | https://www.wgu.edu/bin/wgu-65/api/sitemap.xml |
sitemap | https://www.wgu.edu/heyteach/sitemap.xml |