woodartist.tw
robots.txt

Robots Exclusion Standard data for woodartist.tw

Resource Scan

Scan Details

Site Domain woodartist.tw
Base Domain woodartist.tw
Scan Status Ok
Last Scan5/15/2025, 8:19:29 AM
Next Scan 6/14/2025, 8:19:29 AM

Last Scan

Scanned5/15/2025, 8:19:29 AM
URL https://woodartist.tw/robots.txt
Redirect https://www.woodartist.tw/robots.txt
Redirect Domain www.woodartist.tw
Redirect Base woodartist.tw
Domain IPs 104.21.76.90, 172.67.191.156, 2606:4700:3032::6815:4c5a, 2606:4700:3037::ac43:bf9c
Redirect IPs 104.21.76.90, 172.67.191.156, 2606:4700:3032::6815:4c5a, 2606:4700:3037::ac43:bf9c
Response IP 172.67.191.156
Found Yes
Hash 47a029198ed645ca66b96725cc073a05ec08372ca94d49f98b75d2dd62938924
SimHash 4e149d727451

Groups

*

Rule Path
Disallow /a/
Disallow /account
Disallow /api
Disallow /apps/
Disallow /cart
Disallow /checkout
Disallow /community/
Disallow /orders
Disallow /payments
Disallow /search
Disallow /sf/cart
Disallow /sf/checkout
Disallow /tools/
Disallow /*preview_script_id*
Disallow /*preview_theme_id*
Disallow /apple-app-site-association

adsbot-google
googlebot
googlebot-image

Rule Path
Disallow /api
Disallow /cart
Disallow /checkout
Disallow /orders
Disallow /payments
Disallow /search
Disallow /sf/cart
Disallow /sf/checkout
Disallow /*preview_theme_id*
Disallow /*preview_script_id*

nutch

Rule Path
Disallow /

mj12bot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

blexbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

ahrefsbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

ahrefssiteaudit

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

Other Records

Field Value
sitemap https://www.woodartist.tw/sitemap.xml

Comments

  • Google adsbot ignores robots.txt unless specifically named!
  • Explicitly state Googlebot & Googlebot-Image to try Google Shopping