wgu.edu
robots.txt

Robots Exclusion Standard data for wgu.edu

Resource Scan

Scan Details

Site Domain wgu.edu
Base Domain wgu.edu
Scan Status Ok
Last Scan2024-09-14T20:17:55+00:00
Next Scan 2024-10-14T20:17:55+00:00

Last Scan

Scanned2024-09-14T20:17:55+00:00
URL https://wgu.edu/robots.txt
Redirect https://www.wgu.edu/robots.txt
Redirect Domain www.wgu.edu
Redirect Base wgu.edu
Domain IPs 151.101.194.224
Redirect IPs 151.101.130.224, 151.101.194.224, 151.101.2.224, 151.101.66.224
Response IP 199.232.46.224
Found Yes
Hash d554a429570d8ce30aff2ecedff57a1567c48920d1f1fe887aaf0187a59e724a
SimHash 081d8f068bc0

Groups

*

Rule Path
Disallow /etc/segmentation
Disallow /*jcr%3Acontent
Disallow /bin
Disallow /content/wgu-marketing/en/search.html
Disallow /search.html
Disallow /content/wgu-marketing/en/tools.html
Disallow /content/wgu-marketing/en/tools
Disallow /tools.html
Disallow /tools
Disallow *.print.html
Disallow *.frame.html
Disallow /content/wgu-shared
Disallow cm.wgu.edu

baiduspider

Rule Path
Disallow /

yandexbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.wgu.edu/sitemap.xml
sitemap https://www.wgu.edu/bin/wgu-65/api/sitemap.xml
sitemap https://www.wgu.edu/heyteach/sitemap.xml