thelancet.com
robots.txt
Robots Exclusion Standard data for thelancet.com
Resource Scan
Scan Details
Site Domain | thelancet.com |
Base Domain | thelancet.com |
Scan Status | Ok |
Last Scan | 2024-05-13T03:50:59+00:00 |
Next Scan | 2024-06-12T03:50:59+00:00 |
Last Scan
Scanned | 2024-05-13T03:50:59+00:00 |
URL | https://thelancet.com/robots.txt |
Redirect | https://www.thelancet.com/robots.txt |
Redirect Domain | www.thelancet.com |
Redirect Base | thelancet.com |
Domain IPs | 65.156.1.100 |
Redirect IPs | 104.18.123.114, 104.18.124.114 |
Response IP | 104.18.123.114 |
Found | Yes |
Hash | 86abd7ec1c2be0d507331a9a0ac39934ef24499f6621a8f937e19d76a8b56c37 |
SimHash | 72180860c7f3 |
Groups
*
Rule | Path |
---|---|
Disallow | /action |
Disallow | /help |
Disallow | /search |
Disallow | /feedback |
Disallow | /rss |
Disallow | /action/clickThrough |
Disallow | /action/showLogin |
Disallow | /page/account-confirmation-thanks |
Disallow | /media |
Disallow | /medical-research |
Disallow | /servlet/linkout |
Disallow | /na101/ |
Disallow | /na101v1/ |
Disallow | /na102/ |
Disallow | /doi/mlt/ |
Allow | /action/showJournal |
Allow | /action/showXml |
Allow | /series |
Allow | /isbn |
Allow | /doi/book |
Other Records
Field | Value |
---|---|
sitemap | https://www.thelancet.com/sitemap-index-1.txt |
sitemap | https://www.thelancet.com/custom_pages.gz |