thuysinh4u.com
robots.txt

Robots Exclusion Standard data for thuysinh4u.com

Resource Scan

Scan Details

Site Domain thuysinh4u.com
Base Domain thuysinh4u.com
Scan Status Ok
Last Scan2025-12-24T04:52:53+00:00
Next Scan 2026-01-23T04:52:53+00:00

Last Scan

Scanned2025-12-24T04:52:53+00:00
URL https://thuysinh4u.com/robots.txt
Domain IPs 103.7.6.28
Response IP 103.7.6.28
Found Yes
Hash dee7738a916167e751cf23d43896bbfd02ee8cf7bbd263f08395074646fa49b3
SimHash 67155e4ecdd0

Groups

*

Rule Path
Disallow /admin
Disallow /search
Disallow /checkout
Disallow /account
Disallow /collections/*%2B*
Disallow /collections/*%2B*
Disallow /collections/*%2B*
Disallow /blogs/*%2B*
Disallow /blogs/*%2B*
Disallow /blogs/*%2B*

nutch

Rule Path
Disallow /

mj12bot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

ahrefs

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

Other Records

Field Value
sitemap https://thuysinh4u.com/sitemap.xml

Comments

  • we use Sapo as our ecommerce platform