sheffield.ac.uk
robots.txt

Robots Exclusion Standard data for sheffield.ac.uk

Resource Scan

Scan Details

Site Domain sheffield.ac.uk
Base Domain sheffield.ac.uk
Scan Status Ok
Last Scan2024-10-20T06:41:43+00:00
Next Scan 2024-11-19T06:41:43+00:00

Last Scan

Scanned2024-10-20T06:41:43+00:00
URL https://sheffield.ac.uk/robots.txt
Redirect https://www.sheffield.ac.uk/robots.txt
Redirect Domain www.sheffield.ac.uk
Redirect Base sheffield.ac.uk
Domain IPs 143.167.2.102
Redirect IPs 143.167.2.102
Response IP 143.167.2.102
Found Yes
Hash 6c37eb9d2c6b8935825a1a8865779eb6ce1fc600c9dd6b052fb8b15ae53b43f5
SimHash 2a0dd5530c12

Groups

*

Rule Path
Disallow /phone
Disallow /demo
Disallow /archive/
Disallow /branding
Disallow /homepage-beta
Disallow /nap
Disallow /content/
Disallow /polopoly_fs/1.428354%21/file/Alternative_Guide_Sheffield2014_15.pdf
Disallow /polopoly_fs/1.44653%21/file/AlternativeGuide.pdf
Disallow /diaryofevents
Disallow /careers-whats-on
Disallow /fee-deposits
Disallow /prospectus/buildProspectus.do
Disallow /prospectus/description.do
Disallow /prospectus/myProspectus.do
Disallow /prospectus/calcbursary.do
Disallow /framework/
Disallow /secure-bin/
Disallow /publications/export
Disallow /find?

baiduspider

Rule Path
Disallow /union/planner

zibber-v0.1(www.zibb.com/crawler/)

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.sheffield.ac.uk/sitemap.xml

Comments

  • robots.txt for http://www.shef.ac.uk/
  • NAP
  • CMS File store for P8
  • Search Appliance fixes
  • Java / CGI Apps
  • Googlebot obsessed with...
  • Search
  • Baidu hammering the union site
  • Site Maps