dspace.mit.edu
robots.txt
Robots Exclusion Standard data for dspace.mit.edu
Resource Scan
Scan Details
Site Domain | dspace.mit.edu |
Base Domain | mit.edu |
Scan Status | Ok |
Last Scan | 2024-11-03T08:23:58+00:00 |
Next Scan | 2024-12-03T08:23:58+00:00 |
Last Scan
Scanned | 2024-11-03T08:23:58+00:00 |
URL | https://dspace.mit.edu/robots.txt |
Domain IPs | 34.201.211.163 |
Response IP | 34.201.211.163 |
Found | Yes |
Hash | 9ac0120153f9d2c57c74089c65cbf4cae02b272040668fad3eb5cb163fd76bd7 |
SimHash | a49cdf1545b5 |
Groups
*
Rule | Path |
---|---|
Disallow | /discover |
Disallow | /search-filter |
Disallow | /handle/*/*/discover |
Disallow | /handle/*/*/search-filter |
Disallow | /browse |
Disallow | /handle/*/*/browse |
Other Records
Field | Value |
---|---|
crawl-delay | 10 |
Other Records
Field | Value |
---|---|
sitemap | https://dspace.mit.edu/sitemap |
sitemap | https://dspace.mit.edu/htmlmap |
Comments