twentythree.com.tw
robots.txt

Robots Exclusion Standard data for twentythree.com.tw

Archived Snapshots

Resource Scan

Scan Details

Site Domain	twentythree.com.tw
Base Domain	twentythree.com.tw
Scan Status	Ok
Last Scan	2024-11-16T08:35:02+00:00
Next Scan	2024-12-16T08:35:02+00:00

Last Scan

Scanned	2024-11-16T08:35:02+00:00
URL	https://www.twentythree.com.tw/robots.txt
Domain IPs	108.156.133.48, 108.156.133.53, 108.156.133.74, 108.156.133.89
Response IP	108.156.133.89
Found	Yes
Hash	fd5f4f0a6c59ce72c1203a5fce9aa9ce4ee7480ea2b7e18c2401ef15718a2c22
SimHash	295c1f11cdd6

Groups

mj12bot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

hubspot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

claudebot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

*

Rule	Path
Disallow	/closed
Disallow	/preview/
Disallow	/users/
Disallow	/orders
Disallow	/?debug=*
Disallow	/?theme_preview=*
Disallow	/?price_range_preview=*
Disallow	/?draft=*
Disallow	/api/
Disallow	/themes/
Disallow	/products?query=*

Rule

Path

Disallow

/closed

Disallow

/preview/

Disallow

/users/

Disallow

/orders

Disallow

/*?*debug=*

Disallow

/*?*theme_preview=*

Disallow

/*?*price_range_preview=*

Disallow

/*?*draft=*

Disallow

/api/

Disallow

/themes/

Disallow

/products*?*query=*

Back to top

Other Records

Field	Value
sitemap	https://www.twentythree.com.tw/sitemap.xml

Field

Value

sitemap

https://www.twentythree.com.tw/sitemap.xml

Back to top

Comments

robots.txt file for Shopline Merchant
split user-agent disallows for different bots, as not all bots may follow google's multi-user-agent standard
Allow crawling of all content except

Back to top

twentythree.com.twrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

mj12bot

hubspot

claudebot

*

Other Records

Comments

twentythree.com.tw
robots.txt