inicea.fr
robots.txt
Robots Exclusion Standard data for inicea.fr
Resource Scan
Scan Details
Site Domain | inicea.fr |
Base Domain | inicea.fr |
Scan Status | Ok |
Last Scan | 2024-04-28T05:40:01+00:00 |
Next Scan | 2024-05-28T05:40:01+00:00 |
Last Scan
Scanned | 2024-04-28T05:40:01+00:00 |
URL | https://inicea.fr/robots.txt |
Redirect | https://www.inicea.fr:443/robots.txt |
Redirect Domain | www.inicea.fr |
Redirect Base | inicea.fr |
Domain IPs | 15.236.183.156 |
Redirect IPs | 204.246.191.123, 204.246.191.3, 204.246.191.51, 204.246.191.86 |
Response IP | 18.165.171.21 |
Found | Yes |
Hash | 15ab9705a7ed91048641dace6e04a591bd98c73e3fd9b1eed4bfa5b0c397eb1c |
SimHash | 8d1484704991 |
Groups
*
Rule | Path |
---|---|
Allow | /.css |
Allow | /.js |
Disallow | /resultat-recherche* |
Disallow | /articles?* |
Disallow | *utm |
Disallow | /medias* |