ghientruyen.org
robots.txt
Robots Exclusion Standard data for ghientruyen.org
Resource Scan
Scan Details
| Site Domain | ghientruyen.org |
| Base Domain | ghientruyen.org |
| Scan Status | Ok |
| Last Scan | 2025-11-24T01:13:21+00:00 |
| Next Scan | 2025-12-24T01:13:21+00:00 |
Last Scan
| Scanned | 2025-11-24T01:13:21+00:00 |
| URL | https://ghientruyen.org/robots.txt |
| Domain IPs | 104.21.60.50, 172.67.191.166, 2606:4700:3033::6815:3c32, 2606:4700:3036::ac43:bfa6 |
| Response IP | 172.67.191.166 |
| Found | Yes |
| Hash | a411c11c2506424121379a0723a5de5570cfb139f1caba7410d42d94e174b521 |
| SimHash | e90888e2e192 |
Groups
*
| Rule | Path |
|---|---|
| Disallow | /public/ |
| Disallow | /tim-kiem?keyword=* |
| Disallow | /truyen-tranh/tim-kiem?keyword=* |
| Disallow | /trang-ca-nhan |
| Disallow | /*?sort=asc$ |
| Disallow | /*?sort=desc$ |
| Disallow | /truyen-tranh?page=* |
| Disallow | /*?page=* |
Other Records
| Field | Value |
|---|---|
| sitemap | https://ghientruyen.org/sitemap.xml |