cartucce.com
robots.txt

Robots Exclusion Standard data for cartucce.com

Resource Scan

Scan Details

Site Domain cartucce.com
Base Domain cartucce.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2026-02-27T22:17:38+00:00
Next Scan 2026-03-29T22:17:38+00:00

Last Successful Scan

Scanned2026-01-28T20:41:32+00:00
URL https://cartucce.com/robots.txt
Redirect https://www.cartucce.com/robots.txt
Redirect Domain www.cartucce.com
Redirect Base cartucce.com
Domain IPs 172.66.41.28, 172.66.42.228, 2606:4700:3108::ac42:291c, 2606:4700:3108::ac42:2ae4
Redirect IPs 172.66.41.28, 172.66.42.228, 2606:4700:3108::ac42:291c, 2606:4700:3108::ac42:2ae4
Response IP 172.66.41.28
Found Yes
Hash c3df57f1c98b8e3673dbf22072937303ed19064c84dc6e51eb44a0d7b48e479f
SimHash 574c6d44c652

Groups

*

Rule Path
Disallow /app/
Disallow /store_closed.html
Disallow /?dispatch=product_features.compare
Disallow /recupero-password.html
Disallow /aggiungi-profilo.html
Disallow /autenticazione.html
Disallow /*subcats%3D*
Disallow /*currency%3D*
Disallow /index.php?subcats=*
Disallow /*?sort_by=*
Disallow /*?items_per_page=*
Disallow /*?sl=en
Disallow /magazzino-di-ancona.html
Disallow /magazzino-di-pavia.html
Disallow /*?mobile=
Disallow /*?dispatch=attachments.getfile
Disallow /*?dispatch=product_features.add_product
Disallow /*dispatch%3Dproducts.search
Disallow /*q%3D

Other Records

Field Value
sitemap https://www.cartucce.com/sitemap.xml