thcb.in
robots.txt

Robots Exclusion Standard data for thcb.in

Resource Scan

Scan Details

Site Domain thcb.in
Base Domain thcb.in
Scan Status Ok
Last Scan2026-01-15T17:21:22+00:00
Next Scan 2026-01-22T17:21:22+00:00

Last Scan

Scanned2026-01-15T17:21:22+00:00
URL https://thcb.in/robots.txt
Domain IPs 104.21.85.152, 172.67.207.78, 2606:4700:3034::ac43:cf4e, 2606:4700:3036::6815:5598
Response IP 104.21.85.152
Found Yes
Hash a25978fb2e33015aa8960f5993e17d4a37ec5b215cd0d5a580e878125e628344
SimHash e900a822cfb2

Groups

*

Rule Path
Disallow /wp-content/uploads/wc-logs/
Disallow /wp-content/uploads/woocommerce_transient_files/
Disallow /wp-content/uploads/woocommerce_uploads/
Disallow /*?add-to-cart=
Disallow /*?*add-to-cart=
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php

Other Records

Field Value
sitemap https://thcb.in/sitemap_index.xml