doc-catho.la-croix.com
robots.txt
Robots Exclusion Standard data for doc-catho.la-croix.com
Resource Scan
Scan Details
Site Domain | doc-catho.la-croix.com |
Base Domain | la-croix.com |
Scan Status | Ok |
Last Scan | 2024-05-08T05:33:59+00:00 |
Next Scan | 2024-05-22T05:33:59+00:00 |
Last Scan
Scanned | 2024-05-08T05:33:59+00:00 |
URL | https://doc-catho.la-croix.com/robots.txt |
Domain IPs | 18.160.156.109, 18.160.156.123, 18.160.156.37, 18.160.156.4 |
Response IP | 18.165.171.14 |
Found | Yes |
Hash | 2b1aca5bda996c0d68ecef0c9af372c0f7f526866be21e1efbb11dd956be4e94 |
SimHash | c05348836973 |
Groups
*
Rule | Path |
---|---|
Disallow | /JournalV2/ |
Disallow | /amp/ |
Disallow | /France/ |
Disallow | /Monde/ |
Disallow | /Religion/ |
Disallow | /Economie/ |
Disallow | /Culture/ |
Disallow | /environnement/ |
Disallow | /Famille/ |
Disallow | /Sciences-et-ethique/ |
Disallow | /Sport/ |
Disallow | /Debats/ |
Disallow | /Videos/ |
Disallow | /print/ |
Disallow | /Recherche/ |
Disallow | /Dossiers/ |
Disallow | /Dossiers-Dynamiques/ |
Disallow | /Services/ |
Disallow | /Actualite/ |