idxwi.thelandman.net
robots.txt
Robots Exclusion Standard data for idxwi.thelandman.net
Resource Scan
Scan Details
Site Domain | idxwi.thelandman.net |
Base Domain | thelandman.net |
Scan Status | Failed |
Failure Stage | Fetching resource. |
Failure Reason | Server returned a client error. |
Last Scan | 2024-07-29T12:41:15+00:00 |
Next Scan | 2024-10-27T12:41:15+00:00 |
Last Successful Scan
Scanned | 2023-09-11T12:38:14+00:00 |
URL | https://idxwi.thelandman.net/robots.txt |
Domain IPs | 34.150.135.149 |
Response IP | 34.150.135.149 |
Found | Yes |
Hash | 161843e27b6961eff9d4e7dbf43a8e664669a3a2539b854224d5d2b0ee4020e9 |
SimHash | 463e4733ced8 |
Groups
*
Rule | Path |
---|---|
Disallow | /api/ |
Disallow | /cli/ |
Disallow | /lts/ |
Disallow | /mgmt/ |
Disallow | /parentClasses/ |
Disallow | /scruffy/cli/ |
Disallow | /scruffy/logs/ |
Other Records
Field | Value |
---|---|
crawl-delay | 5 |
Warnings
- 2 invalid lines.