caeassistant.com
robots.txt

Robots Exclusion Standard data for caeassistant.com

Resource Scan

Scan Details

Site Domain caeassistant.com
Base Domain caeassistant.com
Scan Status Ok
Last Scan2025-11-20T18:54:04+00:00
Next Scan 2025-12-20T18:54:04+00:00

Last Scan

Scanned2025-11-20T18:54:04+00:00
URL https://caeassistant.com/robots.txt
Domain IPs 104.21.81.79, 172.67.140.227, 2606:4700:3030::6815:514f, 2606:4700:3034::ac43:8ce3
Response IP 104.21.81.79
Found Yes
Hash 85e8715beffc87fafd13680c628f98836e43c12abf640fd77b8e5a0f8bf56403
SimHash 817139688956

Groups

*

Rule Path
Disallow /store
Disallow /store/
Disallow */feed/
Disallow /my-account/
Disallow /wp-admin/
Disallow /bin/
Disallow /wp-json/
Disallow /vendor/*

gptbot

No rules defined. All paths allowed.

Other Records

Field Value
sitemap https://caeassistant.com/sitemap_index.xml

Comments

  • General rules for all user agents
  • Disallow: /shop/*
  • Disallow: /*add-to-cart=*
  • Disallow: /cart/
  • Disallow: /checkout/
  • Disallow: /search/
  • Disallow: /shop/
  • Disallow: /*?add-to-cart=
  • Disallow: /*?orderby=
  • Disallow: /*?per_page=
  • Disallow: /*?utm_
  • Disallow: /search/*
  • Disallow: *?s*
  • Disallow: *?*
  • Disallow: *utm_*
  • Disallow: /search?q=*
  • Disallow: /shop/?*
  • Disallow: /cart/
  • Disallow: /*add-to-cart=
  • Sitemap link