bumc.bu.edu
robots.txt

Robots Exclusion Standard data for bumc.bu.edu

Resource Scan

Scan Details

Site Domain bumc.bu.edu
Base Domain bu.edu
Scan Status Ok
Last Scan2024-05-10T01:59:28+00:00
Next Scan 2024-06-09T01:59:28+00:00

Last Scan

Scanned2024-05-10T01:59:28+00:00
URL http://bumc.bu.edu/robots.txt
Redirect https://www.bumc.bu.edu/robots.txt
Redirect Domain www.bumc.bu.edu
Redirect Base bu.edu
Domain IPs 128.197.236.79
Redirect IPs 18.173.121.110, 18.173.121.22, 18.173.121.23, 18.173.121.69
Response IP 18.165.171.125
Found Yes
Hash a7b0e14973fffc73b3885bac09cbb49b63ab8432bd380ae81e4fc24a0e691325
SimHash 4155aad3e664

Groups

*

Rule Path
Disallow /calendar
Disallow /cgi-bin
Disallow /wp-admin
Disallow /wp-includes
Disallow /wp-content/plugins
Disallow /wp-content/cache
Disallow /wp-content/themes
Disallow /trackback
Disallow /feed
Disallow /comments
Disallow /category/*/*
Disallow */trackback
Disallow */feed
Disallow */comments
Disallow /*?*
Disallow /*?
Allow /wp-content/uploads

googlebot-image

Rule Path
Disallow
Allow /*

mediapartners-google*

Rule Path
Disallow
Allow /*

Comments

  • Google Image
  • Google AdSense
  • Internet Archiver Wayback Machine
  • User-agent: ia_archiver
  • Disallow: /
  • digg mirror
  • User-agent: duggmirror
  • Disallow: /