isb.edu
robots.txt

Robots Exclusion Standard data for isb.edu

Resource Scan

Scan Details

Site Domain isb.edu
Base Domain isb.edu
Scan Status Ok
Last Scan2024-09-30T07:58:08+00:00
Next Scan 2024-10-30T07:58:08+00:00

Last Scan

Scanned2024-09-30T07:58:08+00:00
URL https://www.isb.edu/robots.txt
Domain IPs 20.44.40.88
Response IP 20.44.40.88
Found Yes
Hash bfe4b5dd41729e724d7a9dd88db024ed1ed2f337eb6001a015a4221245c2790e
SimHash 4e0509c286b2

Groups

*

Rule Path
Disallow /content/dam/sites/isb/study-isb/advanced-management-programmes/amph/ISB_AMPH_Programme%20Brochure.pdf
Disallow /content/dam/sites/isb/study-isb/advanced-management-programmes/ampmo
Disallow /content/dam/sites/isb/study-isb/advanced-management-programmes/amppp
Disallow /content/dam/sites/isb/study-isb/advanced-management-programmes/ampi
Disallow /content/dam/sites/isb/study-isb/advanced-management-programmes/ampba/ampba_brochure
Disallow /content/dam/sites/isb/executive-education/files
Disallow /content/sites/isb/en/study-isb/advanced-management-programmes/amp
Disallow /content/sites/isb/en/executive-education/mp

yandex

Rule Path
Disallow /

moget
ichiro

Rule Path
Disallow /

baiduspider
baiduspider-video
baiduspider-image

Rule Path
Disallow /

sogou spider

Rule Path
Disallow /

googlebot

Rule Path
Disallow /content/dam/sites/isb/*.PDF$
Disallow /content/dam/sites/isb/*.pdf$
Disallow /content/dam/sites/isb/*.TXT$
Disallow /content/dam/sites/isb/*.txt$
Disallow /content/dam/sites/isb/*.DOC$
Disallow /content/dam/sites/isb/*.doc$
Disallow /content/dam/sites/isb/*.DOCX$
Disallow /content/dam/sites/isb/*.docx$
Disallow /content/dam/sites/isb/*.PPT$
Disallow /content/dam/sites/isb/*.ppt$
Disallow /content/dam/sites/isb/*.PPTX$
Disallow /content/dam/sites/isb/*.pptx$
Disallow /content/dam/sites/isb/*.XLS$
Disallow /content/dam/sites/isb/*.xls$
Disallow /content/dam/sites/isb/*.XLSX$
Disallow /content/dam/sites/isb/*.xlsx$
Disallow /content/dam/sites/isb/*.MHT$
Disallow /content/dam/sites/isb/*.mht$
Disallow /content/dam/sites/isb/*.ZIP$
Disallow /content/dam/sites/isb/*.zip$

*

Rule Path
Allow /
Disallow /*?*
Disallow /*?
Disallow /*.js$
Disallow /*.css$
Disallow /*.php$
Disallow /*?p=*&
Disallow /bin/
Disallow /etc/clientlibs/isb
Disallow /README.md

Other Records

Field Value
sitemap https://www.isb.edu/sitemap.xml

Comments

  • robots.txt for ISB
  • Marketing Pages and Brochures
  • Paths (no clean URLs)
  • Block servlet calls
  • Directories
  • Paths (clean URLs)
  • Sitemap