jbs.cam.ac.uk
robots.txt

Robots Exclusion Standard data for jbs.cam.ac.uk

Resource Scan

Scan Details

Site Domain jbs.cam.ac.uk
Base Domain cam.ac.uk
Scan Status Ok
Last Scan2024-04-02T15:50:53+00:00
Next Scan 2024-05-02T15:50:53+00:00

Last Scan

Scanned2024-04-02T15:50:53+00:00
URL https://jbs.cam.ac.uk/robots.txt
Redirect https://www.jbs.cam.ac.uk/robots.txt
Redirect Domain www.jbs.cam.ac.uk
Redirect Base cam.ac.uk
Domain IPs 131.111.150.22
Redirect IPs 141.193.213.20, 141.193.213.21
Response IP 141.193.213.21
Found Yes
Hash 735455e794d552a54dea5050fb4774346d5412d8de1bdf854f5e73702ad83760
SimHash a2201a296d96

Groups

*

Rule Path
Disallow /areas_types/
Disallow /brand/
Disallow /entrepreneurship/forms/
Disallow /execed/
Disallow /execed-master-page/
Disallow /executive-education/learn-more/
Disallow /faculty-research/centres/social-innovation/reach-ely/
Disallow /faculty-research/faculty-a-z/archived-profiles/
Disallow /mba-pre-arrival/
Disallow /parent-of-all-new-content/
Disallow /programmes/executive-mba/forms/
Disallow /programmes/master-of-finance-mfin/forms/
Disallow /programmes/mba/forms/
Disallow /programmes/prospectus-information/
Disallow /via/
Disallow /working-pages/
Disallow /*campaigns
Disallow /*forms/
Disallow /*reusable-elements

Other Records

Field Value
sitemap https://www.jbs.cam.ac.uk/sitemap_index.xml

Comments

  • robots.txt for www.jbs.cam.ac.uk
  • to discourage indexing of 'hidden' folders by all search engines