my.gwu.edu
robots.txt

Robots Exclusion Standard data for my.gwu.edu

Resource Scan

Scan Details

Site Domain my.gwu.edu
Base Domain gwu.edu
Scan Status Ok
Last Scan2025-10-30T01:08:56+00:00
Next Scan 2025-11-29T01:08:56+00:00

Last Scan

Scanned2025-10-30T01:08:56+00:00
URL https://my.gwu.edu/robots.txt
Domain IPs 128.164.216.43
Response IP 128.164.216.43
Found Yes
Hash 1f5f48d3d9914ee2408f609f7ce831c39970c81661f31c400105dc0d7b0c591e
SimHash 8d1c8261d3f0

Groups

*

Rule Path
Allow /$
Allow /?tab=
Allow /mod/directory
Allow /mod/directory/
Allow /mod/exam_schedules
Allow /mod/exam_schedules/
Allow /mod/pws
Allow /mod/pws/
Allow /mod/101things
Allow /mod/101things/
Allow /mod/active_fit
Allow /mod/active_fit/
Allow /mod/fellowships
Allow /mod/fellowships/
Allow /mod/esiagss
Allow /mod/esiagss/
Allow /mod/kacif
Allow /mod/kacif/
Allow /mod/links
Allow /mod/links/
Allow /mod/gwid
Allow /mod/gwid/
Allow /mod/spiders
Allow /mod/spiders/
Allow /mod/expertfinder
Allow /mod/expertfinder/
Disallow /

Comments

  • myGW robots.txt
  • last updated Nov 2022
  • sections that we want to allow
  • disallow everything else