dientulinhanh.com
robots.txt

Robots Exclusion Standard data for dientulinhanh.com

Resource Scan

Scan Details

Site Domain dientulinhanh.com
Base Domain dientulinhanh.com
Scan Status Ok
Last Scan2025-11-19T01:49:16+00:00
Next Scan 2025-12-19T01:49:16+00:00

Last Scan

Scanned2025-11-19T01:49:16+00:00
URL https://dientulinhanh.com/robots.txt
Domain IPs 104.21.80.231, 172.67.155.18, 2606:4700:3032::6815:50e7, 2606:4700:3036::ac43:9b12
Response IP 104.21.80.231
Found Yes
Hash 91dd6821e93ff12642e7a5f9ee1836bc6b15e9920451b73a14f2b19a5d74e858
SimHash a715dc4ecdd0

Groups

*

Rule Path
Disallow /admin
Disallow /search
Disallow /checkout
Disallow /account
Disallow /collections/*%2B*
Disallow /collections/*%2B*
Disallow /collections/*%2B*
Disallow /blogs/*%2B*
Disallow /blogs/*%2B*
Disallow /blogs/*%2B*

nutch

Rule Path
Disallow /

mj12bot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

ahrefs

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

Other Records

Field Value
sitemap https://dientulinhanh.com/sitemap.xml

Comments

  • we use Sapo as our ecommerce platform