ghientruyen.org
robots.txt

Robots Exclusion Standard data for ghientruyen.org

Resource Scan

Scan Details

Site Domain ghientruyen.org
Base Domain ghientruyen.org
Scan Status Ok
Last Scan2025-11-24T01:13:21+00:00
Next Scan 2025-12-24T01:13:21+00:00

Last Scan

Scanned2025-11-24T01:13:21+00:00
URL https://ghientruyen.org/robots.txt
Domain IPs 104.21.60.50, 172.67.191.166, 2606:4700:3033::6815:3c32, 2606:4700:3036::ac43:bfa6
Response IP 172.67.191.166
Found Yes
Hash a411c11c2506424121379a0723a5de5570cfb139f1caba7410d42d94e174b521
SimHash e90888e2e192

Groups

*

Rule Path
Disallow /public/
Disallow /tim-kiem?keyword=*
Disallow /truyen-tranh/tim-kiem?keyword=*
Disallow /trang-ca-nhan
Disallow /*?sort=asc$
Disallow /*?sort=desc$
Disallow /truyen-tranh?page=*
Disallow /*?page=*

Other Records

Field Value
sitemap https://ghientruyen.org/sitemap.xml