truyenhuu.com
robots.txt

Robots Exclusion Standard data for truyenhuu.com

Resource Scan

Scan Details

Site Domain truyenhuu.com
Base Domain truyenhuu.com
Scan Status Ok
Last Scan2025-12-10T21:11:11+00:00
Next Scan 2026-01-09T21:11:11+00:00

Last Scan

Scanned2025-12-10T21:11:11+00:00
URL https://truyenhuu.com/robots.txt
Domain IPs 210.245.8.134
Response IP 210.245.8.134
Found Yes
Hash 8813481025e741e78b46fc6e639d7430bfc5a2e73ecf39c8d803e5ee0320387a
SimHash e7145c4ecdd0

Groups

*

Rule Path
Disallow /admin
Disallow /search
Disallow /checkout
Disallow /account
Disallow /collections/*%2B*
Disallow /collections/*%2B*
Disallow /collections/*%2B*
Disallow /blogs/*%2B*
Disallow /blogs/*%2B*
Disallow /blogs/*%2B*

nutch

Rule Path
Disallow /

mj12bot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

ahrefs

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

Other Records

Field Value
sitemap https://truyenhuu.com/sitemap.xml

Comments

  • we use Sapo as our ecommerce platform