lloydslist.com
robots.txt

Robots Exclusion Standard data for lloydslist.com

Resource Scan

Scan Details

Site Domain lloydslist.com
Base Domain lloydslist.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-09-29T06:05:10+00:00
Next Scan 2024-10-29T06:05:10+00:00

Last Successful Scan

Scanned2024-08-08T06:04:05+00:00
URL https://www.lloydslist.com/robots.txt
Domain IPs 18.202.75.137
Response IP 18.202.75.137
Found Yes
Hash 8f5b4ec8d41e85a3a38aa9b86f6d63013abb93ec1a614a894a86b4f011abb761
SimHash f44bd882d764

Groups

*

Rule Path
Disallow /templates/one
Disallow /templates/two
Disallow /one-hundred-container-ports-2018/data
Disallow /one-hundred-container-ports-2019/data
Disallow /request-subscription-thank-you-page
Disallow /web-briefing
Disallow /one-hundred-edition-ten
Disallow /tour
Disallow /stage
Disallow /divya-testing
Disallow /web-briefing-three
Disallow /one-hundred-edition-nine
Disallow /email-sign-ups
Disallow /editorial-board
Disallow /one-hundred-edition-eleven
Disallow /simon-testing
Disallow /simon-test-page
Disallow /test-one-hundred-container-ports-2022
Disallow /test-one-hundred-edition-thirteen

mj12bot

Rule Path
Disallow /

*

Rule Path
Disallow /rss/

*

Rule Path
Disallow /data-tools/

*

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 1

twitterbot

Rule Path
Disallow