twdive.com
robots.txt

Robots Exclusion Standard data for twdive.com

Resource Scan

Scan Details

Site Domain twdive.com
Base Domain twdive.com
Scan Status Ok
Last Scan2025-10-12T07:55:34+00:00
Next Scan 2025-11-11T07:55:34+00:00

Last Scan

Scanned2025-10-12T07:55:34+00:00
URL https://twdive.com/robots.txt
Redirect https://www.twdive.com/robots.txt
Redirect Domain www.twdive.com
Redirect Base twdive.com
Domain IPs 104.21.57.107, 172.67.190.20, 2606:4700:3030::ac43:be14, 2606:4700:3035::6815:396b
Redirect IPs 104.21.57.107, 172.67.190.20, 2606:4700:3030::ac43:be14, 2606:4700:3035::6815:396b
Response IP 172.67.190.20
Found Yes
Hash c2d094a935e303156df8355870751de0cbae4b2384e4ce87598abd0b3b14993b
SimHash 04149d767451

Groups

*

Rule Path
Disallow /a/
Disallow /account
Disallow /api
Disallow /apps/
Disallow /cart
Disallow /checkout
Disallow /community/
Disallow /orders
Disallow /payments
Disallow /search
Disallow /sf/cart
Disallow /sf/checkout
Disallow /tools/
Disallow /*preview_script_id*
Disallow /*preview_theme_id*
Disallow /apple-app-site-association

adsbot-google
googlebot
googlebot-image

Rule Path
Disallow /api
Disallow /cart
Disallow /checkout
Disallow /orders
Disallow /payments
Disallow /search
Disallow /sf/cart
Disallow /sf/checkout
Disallow /*preview_theme_id*
Disallow /*preview_script_id*

nutch

Rule Path
Disallow /

mj12bot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

blexbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

ahrefsbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

ahrefssiteaudit

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

Other Records

Field Value
sitemap https://www.twdive.com/sitemap.xml

Comments

  • Google adsbot ignores robots.txt unless specifically named!
  • Explicitly state Googlebot & Googlebot-Image to try Google Shopping