thelancet.com
robots.txt
Robots Exclusion Standard data for thelancet.com
Resource Scan
Scan Details
Site Domain | thelancet.com |
Base Domain | thelancet.com |
Scan Status | Ok |
Last Scan | 2024-10-10T13:07:39+00:00 |
Next Scan | 2024-11-09T13:07:39+00:00 |
Last Scan
Scanned | 2024-10-10T13:07:39+00:00 |
URL | https://thelancet.com/robots.txt |
Redirect | https://www.thelancet.com/robots.txt |
Redirect Domain | www.thelancet.com |
Redirect Base | thelancet.com |
Domain IPs | 65.156.1.100 |
Redirect IPs | 162.159.140.114, 172.66.0.112 |
Response IP | 162.159.140.114 |
Found | Yes |
Hash | 254c3a5f63e41457ba64ebc9e81167a5eedb2b3dc835fb100f2d5f09ac7fe8a8 |
SimHash | 6a180820c7f3 |
Groups
*
Rule | Path |
---|---|
Disallow | /action |
Disallow | /help |
Disallow | /search |
Disallow | /feedback |
Disallow | /rss |
Disallow | /action/clickThrough |
Disallow | /action/showLogin |
Disallow | /page/account-confirmation-thanks |
Disallow | /media |
Disallow | /medical-research |
Disallow | /servlet/linkout |
Disallow | /na101/ |
Disallow | /na101v1/ |
Disallow | /na102/ |
Disallow | /doi/mlt/ |
Allow | /action/showJournal |
Allow | /action/showXml |
Allow | /series |
Allow | /isbn |
Allow | /doi/book |
Allow | /.well-known/tdmrep.json |
Other Records
Field | Value |
---|---|
sitemap | https://www.thelancet.com/sitemap-index-1.txt |
sitemap | https://www.thelancet.com/custom_pages.gz |