www.rotman.utoronto.ca
robots.txt

Robots Exclusion Standard data for www.rotman.utoronto.ca

Resource Scan

Scan Details

Site Domain www.rotman.utoronto.ca
Base Domain utoronto.ca
Scan Status Ok
Last Scan2025-07-02T01:40:18+00:00
Next Scan 2025-08-01T01:40:18+00:00

Last Scan

Scanned2025-07-02T01:40:18+00:00
URL https://www.rotman.utoronto.ca/robots.txt
Domain IPs 15.157.79.226, 3.97.145.103
Response IP 15.157.79.226
Found Yes
Hash c086d3ba609d58cdb379c48f0c1dc9f378a8a137759a43a33cffbde59f07c9a0
SimHash 6010dd536731

Groups

terminalfour nutch spider

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 0.5

searchstax crawler/1.0 (+https://www.searchstax.com)

Rule Path
Disallow /search/
Disallow /site-assets/
Disallow /component-library/

Other Records

Field Value
crawl-delay 0.5

googlebot

Rule Path
Allow /

Other Records

Field Value
crawl-delay 1

googlebot-image

Rule Path
Allow /

Other Records

Field Value
crawl-delay 1

duckduckbot

Rule Path
Allow /

Other Records

Field Value
crawl-delay 1

bingbot

Rule Path
Allow /

Other Records

Field Value
crawl-delay 1

msnbot

Rule Path
Allow /

Other Records

Field Value
crawl-delay 1

baiduspider

Rule Path
Allow /

Other Records

Field Value
crawl-delay 1

yeti

Rule Path
Allow /

Other Records

Field Value
crawl-delay 1

*

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.rotman.utoronto.ca/sitemap-en.xml