cae-groupe.com
robots.txt

Robots Exclusion Standard data for cae-groupe.com

Resource Scan

Scan Details

Site Domain cae-groupe.com
Base Domain cae-groupe.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2026-02-04T12:59:55+00:00
Next Scan 2026-05-05T12:59:55+00:00

Last Successful Scan

Scanned2025-06-16T03:41:51+00:00
URL https://cae-groupe.com/robots.txt
Redirect https://www.cae-groupe.com/robots.txt
Redirect Domain www.cae-groupe.com
Redirect Base cae-groupe.com
Domain IPs 104.21.83.6, 172.67.210.53, 2606:4700:3035::ac43:d235, 2606:4700:3036::6815:5306
Redirect IPs 104.21.83.6, 172.67.210.53, 2606:4700:3035::ac43:d235, 2606:4700:3036::6815:5306
Response IP 172.67.210.53
Found Yes
Hash dec7fa1ce06545d17fb26ed0d43259f06ea6f3f96ec035af2da8e2056ffc4416
SimHash 6764dd43a732

Groups

*

Rule Path
Disallow /index.php/
Disallow /*?
Disallow /checkout/
Disallow /app/
Disallow /lib/
Disallow /*.php$
Disallow /pkginfo/
Disallow /report/
Disallow /var/
Disallow /catalog/
Disallow /customer/
Disallow /sendfriend/
Disallow /review/
Disallow /*SID%3D
Disallow */pub/media/catalog/product/cache/*
Disallow */productalert/add/stock/product_id/*
Disallow *.html?*
Disallow *?q=*
Disallow */algoliasearch/*
Disallow */sendfriend/*
Allow /*.js
Allow /*.css
Allow /*.jpg
Allow /*.jpeg
Allow /*.png

Other Records

Field Value
crawl-delay 10

Other Records

Field Value
sitemap https://www.cae-groupe.com/media/documents/sitemap.xml/sitemap.xml
sitemap https://www.cae-groupe.com/media/documents/sitemap.xml/sitemap_index.xml

Comments

  • Native
  • Custom
  • CSS, JS, Images
  • Sitemaps