www2.mrc-lmb.cam.ac.uk
robots.txt

Robots Exclusion Standard data for www2.mrc-lmb.cam.ac.uk

Resource Scan

Scan Details

Site Domain www2.mrc-lmb.cam.ac.uk
Base Domain cam.ac.uk
Scan Status Ok
Last Scan2025-07-24T10:37:35+00:00
Next Scan 2025-08-23T10:37:35+00:00

Last Scan

Scanned2025-07-24T10:37:35+00:00
URL https://www2.mrc-lmb.cam.ac.uk/robots.txt
Domain IPs 131.111.85.100
Response IP 131.111.85.100
Found Yes
Hash 7942036d34b2d030bb8dfc1eb2c15f1809c3e58f129730a55f76ea07ab4eb2c2
SimHash a128cd704adf

Groups

*

Rule Path
Disallow /Groups/media/
Disallow /Groups/media/oneill/
Disallow /administrator/
Disallow /cache/
Disallow /components/
Disallow /images/
Disallow /includes/
Disallow /installation/
Disallow /language/
Disallow /libraries/
Disallow /media/
Disallow /modules/
Disallow /plugins/
Disallow /templates/
Disallow /tmp/
Disallow /xmlrpc/
Disallow /internal/
Disallow /mymrclife/
Disallow /wordpress/wp-content/uploads/
Allow /wordpress/wp-content/uploads/*.jpg
Allow /wordpress/wp-content/uploads/*.gif
Allow /wordpress/wp-content/uploads/*.png
Allow /wordpress/wp-content/uploads/*.jpeg

Other Records

Field Value
sitemap https://www2.mrc-lmb.cam.ac.uk/sitemap.xml

Comments

  • User-agent: Screaming Frog SEO Spider
  • Allow: /internal/