intermarche.com
robots.txt
Robots Exclusion Standard data for intermarche.com
Resource Scan
Scan Details
| Site Domain | intermarche.com |
| Base Domain | intermarche.com |
| Scan Status | Failed |
| Failure Stage | Fetching resource. |
| Failure Reason | Server returned a server error. |
| Last Scan | 2026-01-07T07:54:51+00:00 |
| Next Scan | 2026-04-07T07:54:51+00:00 |
Last Successful Scan
| Scanned | 2024-09-13T23:43:38+00:00 |
| URL | https://www.intermarche.com/robots.txt |
| Domain IPs | 34.107.172.90 |
| Response IP | 34.107.172.90 |
| Found | Yes |
| Hash | 33ca03ade3e8b2d778b0fb8e1a02f4aa43b73a2a51d99a1074eb28922d4e2cee |
| SimHash | 54d4d04683f1 |
Groups
*
| Rule | Path |
|---|---|
| Allow | /magasins/*/*/infos-pratiques |
| Disallow | /accueil/drive-catalogue/* |
| Disallow | /magasins/* |
| Disallow | /rechercheproduits/* |
| Disallow | /?pdvref* |
| Disallow | /localisation/* |
| Disallow | *trier* |
| Disallow | *voir-tout* |
| Disallow | /catalogues/* |
| Disallow | /s/* |
| Disallow | *?itemId=* |
| Disallow | /recherche/* |
| Disallow | /api/* |
| Disallow | /catalog/* |
| Disallow | /_next/* |
| Disallow | *..png* |