entreprises.lefigaro.fr
robots.txt
Robots Exclusion Standard data for entreprises.lefigaro.fr
Resource Scan
Scan Details
Site Domain | entreprises.lefigaro.fr |
Base Domain | lefigaro.fr |
Scan Status | Ok |
Last Scan | 2024-04-14T18:03:53+00:00 |
Next Scan | 2024-05-14T18:03:53+00:00 |
Last Scan
Scanned | 2024-04-14T18:03:53+00:00 |
URL | https://entreprises.lefigaro.fr/robots.txt |
Domain IPs | 104.69.44.107 |
Response IP | 23.41.75.163 |
Found | Yes |
Hash | 7ddd214d614c00362b344348df5cf8a6f921dde77540509219166e54184a6e71 |
SimHash | a14c4910eb90 |
Groups
*
Rule | Path |
---|---|
Disallow | /recherche$ |
Disallow | /recherche? |
Disallow | /*/entreprise-*/veille |
Disallow | /syntheses-actu/ |
Other Records
Field | Value |
---|---|
sitemap | https://entreprises.lefigaro.fr/sitemap/ |
Comments