nettruyenbe.com
robots.txt
Robots Exclusion Standard data for nettruyenbe.com
Resource Scan
Scan Details
Site Domain | nettruyenbe.com |
Base Domain | nettruyenbe.com |
Scan Status | Ok |
Last Scan | 2024-06-24T06:15:17+00:00 |
Next Scan | 2024-07-01T06:15:17+00:00 |
Last Scan
Scanned | 2024-06-24T06:15:17+00:00 |
URL | https://nettruyenbe.com/robots.txt |
Domain IPs | 104.21.11.177, 172.67.149.209, 2606:4700:3031::6815:bb1, 2606:4700:3035::ac43:95d1 |
Response IP | 172.67.149.209 |
Found | Yes |
Hash | 0176b56ee1520431aaa5b236e9270c78f20fe14b1ed9ccdb25b95614315c4129 |
SimHash | 09009e209410 |
Groups
*
Rule | Path |
---|---|
Disallow | */include/* |
Disallow | */content/* |
Disallow | */%26* |
Disallow | *deb%3D* |
Disallow | *?%3F* |
Disallow | */uncat/* |
Disallow | */tag/*/feed* |
Disallow | *-key/feed* |
Disallow | */print/* |
Disallow | /*.ph* |
Disallow | /*showcat* |
Disallow | *ImageView.htm* |
Disallow | *iframe.htm* |
Disallow | *index.htm* |
Disallow | *data%3Aimage* |
Allow | / |
Other Records
Field | Value |
---|---|
sitemap | https://nettruyenbe.com/sitemap.xml |