hope.edu
robots.txt

Robots Exclusion Standard data for hope.edu

Archived Snapshots

Resource Scan

Scan Details

Site Domain	hope.edu
Base Domain	hope.edu
Scan Status	Ok
Last Scan	2024-10-28T15:34:47+00:00
Next Scan	2024-11-27T15:34:47+00:00

Last Scan

Scanned	2024-10-28T15:34:47+00:00
URL	https://hope.edu/robots.txt
Domain IPs	209.140.194.21
Response IP	209.140.194.21
Found	Yes
Hash	0f367ef1e10f0ecead511e844ed658acd22cb4ea72583729692b603746ec084c
SimHash	a4800d0c1792

Groups

*

Rule	Path
Disallow	/sitemap-generator.html
Disallow	/_resources/
Disallow	/_offices/
Disallow	/_academics/
Disallow	/_showcase/
Disallow	/_training/
Disallow	/_dev/
Disallow	/_mh-dev/
Disallow	/email/
Disallow	/catalog/current/majors-minors/index.html
Disallow	/catalog/working/
Disallow	/*.xml$
Disallow	/*.inc$
Disallow	/*.php$
Disallow	/*.txt$
Disallow	/*_props.html$
Disallow	/offices/computing-information-technology/wi-fi.html
Disallow	/admissions/niche-direct-admissions.html
Allow	/sitemap.xml
Allow	/data/htdocs/sitemap.xml

Rule

Path

Disallow

/sitemap-generator.html

Disallow

/_resources/

Disallow

/_offices/

Disallow

/_academics/

Disallow

/_showcase/

Disallow

/_training/

Disallow

/_dev/

Disallow

/_mh-dev/

Disallow

/email/

Disallow

/catalog/current/majors-minors/index.html

Disallow

/catalog/working/

Disallow

/*.xml$

Disallow

/*.inc$

Disallow

/*.php$

Disallow

/*.txt$

Disallow

/*_props.html$

Disallow

/offices/computing-information-technology/wi-fi.html

Disallow

/admissions/niche-direct-admissions.html

Allow

/sitemap.xml

Allow

/data/htdocs/sitemap.xml

Back to top

Other Records

Field	Value
sitemap	https://hope.edu/sitemap.xml

Field

Value

sitemap

https://hope.edu/sitemap.xml

Back to top

hope.edurobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

Other Records

hope.edu
robots.txt