sloanreview.mit.edu
robots.txt

Robots Exclusion Standard data for sloanreview.mit.edu

Resource Scan

Scan Details

Site Domain sloanreview.mit.edu
Base Domain mit.edu
Scan Status Ok
Last Scan2024-04-01T15:53:07+00:00
Next Scan 2024-05-01T15:53:07+00:00

Last Scan

Scanned2024-04-01T15:53:07+00:00
URL https://sloanreview.mit.edu/robots.txt
Domain IPs 141.193.213.20, 141.193.213.21
Response IP 141.193.213.20
Found Yes
Hash e09df31ed88c63dc0273e066367f2f6f707f49c1375ff8b07c1145c64f4135d5
SimHash 6104d8136793

Groups

scrapy

Rule Path
Allow /

*

Rule Path
Disallow /wp-admin/
Disallow /*.pdf$
Disallow /*.epub$
Disallow /media-download/
Disallow /wikis/magazine-articles/
Disallow /wikis/
Allow /wp-admin/admin-ajax.php

Other Records

Field Value
crawl-delay 10

gptbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://sloanreview.mit.edu/sitemap.xml