idxwi.thelandman.net
robots.txt

Robots Exclusion Standard data for idxwi.thelandman.net

Resource Scan

Scan Details

Site Domain idxwi.thelandman.net
Base Domain thelandman.net
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-04-30T12:40:54+00:00
Next Scan 2024-07-29T12:40:54+00:00

Last Successful Scan

Scanned2023-09-11T12:38:14+00:00
URL https://idxwi.thelandman.net/robots.txt
Domain IPs 34.150.135.149
Response IP 34.150.135.149
Found Yes
Hash 161843e27b6961eff9d4e7dbf43a8e664669a3a2539b854224d5d2b0ee4020e9
SimHash 463e4733ced8

Groups

*

Rule Path
Disallow /api/
Disallow /cli/
Disallow /lts/
Disallow /mgmt/
Disallow /parentClasses/
Disallow /scruffy/cli/
Disallow /scruffy/logs/

Other Records

Field Value
crawl-delay 5

baiduspider

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

cazoodlebot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

twiceler

Rule Path
Disallow /

exabot

Rule Path
Disallow /

mlbot

Rule Path
Disallow /

iisbot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

speedy

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

panscient.com

Rule Path
Disallow /

siteexplorer

Rule Path
Disallow /

searchmetricsbot

Rule Path
Disallow /

Warnings

  • 2 invalid lines.