houcing.cf
robots.txt
Robots Exclusion Standard data for houcing.cf
Resource Scan
Scan Details
Site Domain | houcing.cf |
Base Domain | houcing.cf |
Scan Status | Failed |
Failure Stage | Fetching resource. |
Failure Reason | Couldn't connect to server. |
Last Scan | 2024-09-05T10:47:57+00:00 |
Next Scan | 2024-10-05T10:47:57+00:00 |
Last Successful Scan
Scanned | 2024-07-15T10:47:14+00:00 |
URL | https://houcing.cf/robots.txt |
Domain IPs | 104.21.45.83, 172.67.212.67, 2606:4700:3031::ac43:d443, 2606:4700:3037::6815:2d53 |
Response IP | 172.67.212.67 |
Found | Yes |
Hash | c15c89304193d69edc667eca4a40d5e51a0a1dd3fcc9a42a05e5bbabe94fff76 |
SimHash | dc5655566312 |
Groups
easouspider
Rule | Path |
---|---|
Disallow | /js/ |
Disallow | /css/ |
Disallow | /ck/ |
Disallow | /dist/ |
Other Records
Field | Value |
---|---|
crawl-delay | 10 |
msnbot
Rule | Path |
---|---|
Disallow | /js/ |
Disallow | /css/ |
Disallow | /ck/ |
Disallow | /dist/ |
Other Records
Field | Value |
---|---|
crawl-delay | 10 |
petalbot
Rule | Path |
---|---|
Disallow | /js/ |
Disallow | /css/ |
Disallow | /ck/ |
Disallow | /dist/ |
Other Records
Field | Value |
---|---|
crawl-delay | 10 |
*
Rule | Path |
---|---|
Disallow | / |
Warnings
- 5 invalid lines.
- `request-rate` is not a known field.