newhaven.edu
robots.txt

Robots Exclusion Standard data for newhaven.edu

Resource Scan

Scan Details

Site Domain newhaven.edu
Base Domain newhaven.edu
Scan Status Ok
Last Scan2024-10-31T18:29:00+00:00
Next Scan 2024-11-30T18:29:00+00:00

Last Scan

Scanned2024-10-31T18:29:00+00:00
URL https://newhaven.edu/robots.txt
Redirect https://www.newhaven.edu/robots.txt
Redirect Domain www.newhaven.edu
Redirect Base newhaven.edu
Domain IPs 35.153.150.220, 54.197.98.184
Redirect IPs 35.153.150.220, 54.197.98.184
Response IP 54.197.98.184
Found Yes
Hash a05ce3770c52279b18effa1d2590aafdc445bf73a5d6900bef7a631060c02587
SimHash 400515c14dd2

Groups

*

Rule Path
Disallow /__unpublished-save-for-later/
Disallow /*.pcf$
Disallow /*.inc$
Disallow /*_nav.php$
Disallow /includes

Other Records

Field Value
sitemap https://www.newhaven.edu/sitemap.xml

Comments

  • robots.txt for www.newhaven.edu
  • test