press.uchicago.edu
robots.txt

Robots Exclusion Standard data for press.uchicago.edu

Resource Scan

Scan Details

Site Domain press.uchicago.edu
Base Domain uchicago.edu
Scan Status Ok
Last Scan2024-11-03T07:50:30+00:00
Next Scan 2024-12-03T07:50:30+00:00

Last Scan

Scanned2024-11-03T07:50:30+00:00
URL https://press.uchicago.edu/robots.txt
Domain IPs 205.208.4.177
Response IP 205.208.4.177
Found Yes
Hash 4d315f9137f73ccdb04ee27f6d95a50f045bb5ac68c787e53dbf6e7137c4d539
SimHash 3d9818564c80

Groups

ccbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 20

semrushbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 50

gsa-crawler

Rule Path
Disallow *

*

Rule Path
Disallow /books/compcopy*
Disallow /books/textadoption*
Disallow /books/downloadSubjectCsv*
Disallow /journals/e-readers/processor/*
Disallow *.ctl
Disallow *.epl

Other Records

Field Value
crawl-delay 2

Other Records

Field Value
sitemap https://press.uchicago.edu/docroot/sitemap.xml
sitemap https://press.uchicago.edu/docroot/authorsSitemap.xml
sitemap https://press.uchicago.edu/docroot/cBooksSitemap.xml
sitemap https://press.uchicago.edu/docroot/dBooksSitemap.xml