techx.com.my
robots.txt

Robots Exclusion Standard data for techx.com.my

Resource Scan

Scan Details

Site Domain techx.com.my
Base Domain techx.com.my
Scan Status Ok
Last Scan2025-03-16T05:50:53+00:00
Next Scan 2025-04-15T05:50:53+00:00

Last Scan

Scanned2025-03-16T05:50:53+00:00
URL https://techx.com.my/robots.txt
Domain IPs 151.101.130.236, 151.101.194.236, 151.101.2.236, 151.101.66.236
Response IP 151.101.2.236
Found Yes
Hash 80479bb21fab50a95c4cacb5f443a1985142e0e7f57757de57cd62362b4d8e12
SimHash 64149d767451

Groups

*

Rule Path
Disallow /a/
Disallow /account
Disallow /api
Disallow /apps/
Disallow /cart
Disallow /checkout
Disallow /community/
Disallow /orders
Disallow /payments
Disallow /search
Disallow /sf/cart
Disallow /sf/checkout
Disallow /tools/
Disallow /*preview_script_id*
Disallow /*preview_theme_id*
Disallow /apple-app-site-association

adsbot-google
googlebot
googlebot-image

Rule Path
Disallow /api
Disallow /cart
Disallow /checkout
Disallow /orders
Disallow /payments
Disallow /search
Disallow /sf/cart
Disallow /sf/checkout
Disallow /*preview_theme_id*
Disallow /*preview_script_id*

nutch

Rule Path
Disallow /

mj12bot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

blexbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

ahrefsbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

ahrefssiteaudit

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

Other Records

Field Value
sitemap https://techx.com.my/sitemap.xml

Comments

  • Google adsbot ignores robots.txt unless specifically named!
  • Explicitly state Googlebot & Googlebot-Image to try Google Shopping