press.uchicago.edu
robots.txt
Robots Exclusion Standard data for press.uchicago.edu
Resource Scan
Scan Details
Site Domain | press.uchicago.edu |
Base Domain | uchicago.edu |
Scan Status | Ok |
Last Scan | 2024-11-03T07:50:30+00:00 |
Next Scan | 2024-12-03T07:50:30+00:00 |
Last Scan
Scanned | 2024-11-03T07:50:30+00:00 |
URL | https://press.uchicago.edu/robots.txt |
Domain IPs | 205.208.4.177 |
Response IP | 205.208.4.177 |
Found | Yes |
Hash | 4d315f9137f73ccdb04ee27f6d95a50f045bb5ac68c787e53dbf6e7137c4d539 |
SimHash | 3d9818564c80 |
Groups
*
Rule | Path |
---|---|
Disallow | /books/compcopy* |
Disallow | /books/textadoption* |
Disallow | /books/downloadSubjectCsv* |
Disallow | /journals/e-readers/processor/* |
Disallow | *.ctl |
Disallow | *.epl |
Other Records
Field | Value |
---|---|
crawl-delay | 2 |
Other Records
Field | Value |
---|---|
sitemap | https://press.uchicago.edu/docroot/sitemap.xml |
sitemap | https://press.uchicago.edu/docroot/authorsSitemap.xml |
sitemap | https://press.uchicago.edu/docroot/cBooksSitemap.xml |
sitemap | https://press.uchicago.edu/docroot/dBooksSitemap.xml |