tusachnhoxinh.com
robots.txt
Robots Exclusion Standard data for tusachnhoxinh.com
Resource Scan
Scan Details
| Site Domain | tusachnhoxinh.com |
| Base Domain | tusachnhoxinh.com |
| Scan Status | Ok |
| Last Scan | 2025-11-05T18:05:15+00:00 |
| Next Scan | 2025-11-12T18:05:15+00:00 |
Last Scan
| Scanned | 2025-11-05T18:05:15+00:00 |
| URL | https://tusachnhoxinh.com/robots.txt |
| Domain IPs | 104.21.32.26, 172.67.182.86, 2606:4700:3031::ac43:b656, 2606:4700:3037::6815:201a |
| Response IP | 104.21.32.26 |
| Found | Yes |
| Hash | 43d1bb3bd3885779b5648ec3caf464c43deccda45ef662a833cef935ab220f44 |
| SimHash | 09009e208490 |
Groups
*
| Rule | Path |
|---|---|
| Disallow | */include/* |
| Disallow | */content/* |
| Disallow | */%26* |
| Disallow | *deb%3D* |
| Disallow | *?%3F* |
| Disallow | */uncat/* |
| Disallow | */tag/*/feed* |
| Disallow | *-key/feed* |
| Disallow | */print/* |
| Disallow | /*.ph* |
| Disallow | /*showcat* |
| Disallow | *ImageView.htm* |
| Disallow | *iframe.htm* |
| Disallow | *index.htm* |
| Disallow | *data%3Aimage* |
| Allow | / |
Other Records
| Field | Value |
|---|---|
| sitemap | https://tusachnhoxinh.com/sitemap.xml |