gsc-tw.com
robots.txt

Robots Exclusion Standard data for gsc-tw.com

Resource Scan

Scanned	2025-06-06T09:24:02+00:00
URL	https://www.gsc-tw.com/robots.txt
Domain IPs	54.178.132.123
Response IP	54.178.132.123
Found	Yes
Hash	0febd5e9463d5471956ad01ef1af0b0e788a47bf844d808fbc4dbdb938470030
SimHash	a7950d0f6454

Rule

Path

Disallow

/admin/

Disallow

/user/sign_in

Disallow

/cart

Disallow

/account

Back to top

Field	Value
sitemap	https://www.gsc-tw.com/sitemap.xml
sitemap	https://www.gsc-tw.com/zh-TW/sitemap.xml
sitemap	https://www.gsc-tw.com/en/sitemap.xml

Field

Value

sitemap

https://www.gsc-tw.com/sitemap.xml

sitemap

https://www.gsc-tw.com/zh-TW/sitemap.xml

sitemap

https://www.gsc-tw.com/en/sitemap.xml

Back to top

shop global
See http://www.robotstxt.org/wc/norobots.html for documentation on how to use the robots.txt file
To ban all spiders from the entire site uncomment the next two lines:
User-Agent: *
Disallow: /

Back to top