hope.edu
robots.txt

Robots Exclusion Standard data for hope.edu

Resource Scan

Scan Details

Site Domain hope.edu
Base Domain hope.edu
Scan Status Ok
Last Scan2024-10-28T15:34:47+00:00
Next Scan 2024-11-27T15:34:47+00:00

Last Scan

Scanned2024-10-28T15:34:47+00:00
URL https://hope.edu/robots.txt
Domain IPs 209.140.194.21
Response IP 209.140.194.21
Found Yes
Hash 0f367ef1e10f0ecead511e844ed658acd22cb4ea72583729692b603746ec084c
SimHash a4800d0c1792

Groups

*

Rule Path
Disallow /sitemap-generator.html
Disallow /_resources/
Disallow /_offices/
Disallow /_academics/
Disallow /_showcase/
Disallow /_training/
Disallow /_dev/
Disallow /_mh-dev/
Disallow /email/
Disallow /catalog/current/majors-minors/index.html
Disallow /catalog/working/
Disallow /*.xml$
Disallow /*.inc$
Disallow /*.php$
Disallow /*.txt$
Disallow /*_props.html$
Disallow /offices/computing-information-technology/wi-fi.html
Disallow /admissions/niche-direct-admissions.html
Allow /sitemap.xml
Allow /data/htdocs/sitemap.xml

Other Records

Field Value
sitemap https://hope.edu/sitemap.xml