5h.com
robots.txt
Robots Exclusion Standard data for 5h.com
Resource Scan
Scan Details
Site Domain | 5h.com |
Base Domain | 5h.com |
Scan Status | Ok |
Last Scan | 2024-10-03T11:51:32+00:00 |
Next Scan | 2024-11-02T11:51:32+00:00 |
Last Scan
Scanned | 2024-10-03T11:51:32+00:00 |
URL | http://5h.com/robots.txt |
Redirect | http://www.5h.com/robots.txt |
Redirect Domain | www.5h.com |
Redirect Base | 5h.com |
Domain IPs | 163.181.81.231, 163.181.81.232, 163.181.81.233, 163.181.81.234, 163.181.81.235, 163.181.81.236, 163.181.81.237, 163.181.81.238 |
Redirect IPs | 138.113.246.220, 168.235.202.16 |
Response IP | 168.235.202.16 |
Found | Yes |
Hash | 99c08820fe5223fa949024e8b10bfe4e8552f6520fbe167ab2ca4b62716fe411 |
SimHash | f854c6b341d7 |
Groups
*
Rule | Path |
---|---|
Disallow | /d/ |
Disallow | /e/class/ |
Disallow | /e/config/ |
Disallow | /e/data/ |
Disallow | /e/enews/ |
Disallow | /e/update/ |
Disallow | /ask/hd/* |
Disallow | /ku/*_*_*_*_*_*_*.html%26 |
Disallow | /e/action/ |
Disallow | /e/tags* |
Disallow | /e/search* |
Allow | / |