intermarche.com
robots.txt

Robots Exclusion Standard data for intermarche.com

Resource Scan

Scan Details

Site Domain intermarche.com
Base Domain intermarche.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a server error.
Last Scan2026-01-07T07:54:51+00:00
Next Scan 2026-04-07T07:54:51+00:00

Last Successful Scan

Scanned2024-09-13T23:43:38+00:00
URL https://www.intermarche.com/robots.txt
Domain IPs 34.107.172.90
Response IP 34.107.172.90
Found Yes
Hash 33ca03ade3e8b2d778b0fb8e1a02f4aa43b73a2a51d99a1074eb28922d4e2cee
SimHash 54d4d04683f1

Groups

*

Rule Path
Allow /magasins/*/*/infos-pratiques
Disallow /accueil/drive-catalogue/*
Disallow /magasins/*
Disallow /rechercheproduits/*
Disallow /?pdvref*
Disallow /localisation/*
Disallow *trier*
Disallow *voir-tout*
Disallow /catalogues/*
Disallow /s/*
Disallow *?itemId=*
Disallow /recherche/*
Disallow /api/*
Disallow /catalog/*
Disallow /_next/*
Disallow *..png*