doc-catho.la-croix.com
robots.txt

Robots Exclusion Standard data for doc-catho.la-croix.com

Resource Scan

Scan Details

Site Domain doc-catho.la-croix.com
Base Domain la-croix.com
Scan Status Ok
Last Scan2024-05-08T05:33:59+00:00
Next Scan 2024-05-22T05:33:59+00:00

Last Scan

Scanned2024-05-08T05:33:59+00:00
URL https://doc-catho.la-croix.com/robots.txt
Domain IPs 18.160.156.109, 18.160.156.123, 18.160.156.37, 18.160.156.4
Response IP 18.165.171.14
Found Yes
Hash 2b1aca5bda996c0d68ecef0c9af372c0f7f526866be21e1efbb11dd956be4e94
SimHash c05348836973

Groups

*

Rule Path
Disallow /JournalV2/
Disallow /amp/
Disallow /France/
Disallow /Monde/
Disallow /Religion/
Disallow /Economie/
Disallow /Culture/
Disallow /environnement/
Disallow /Famille/
Disallow /Sciences-et-ethique/
Disallow /Sport/
Disallow /Debats/
Disallow /Videos/
Disallow /print/
Disallow /Recherche/
Disallow /Dossiers/
Disallow /Dossiers-Dynamiques/
Disallow /Services/
Disallow /Actualite/