cg18.fr
robots.txt
Robots Exclusion Standard data for cg18.fr
Resource Scan
Scan Details
Site Domain | cg18.fr |
Base Domain | cg18.fr |
Scan Status | Ok |
Last Scan | 2025-04-06T04:27:07+00:00 |
Next Scan | 2025-05-06T04:27:07+00:00 |
Last Scan
Scanned | 2025-04-06T04:27:07+00:00 |
URL | https://cg18.fr/robots.txt |
Redirect | https://www.departement18.fr/robots.txt |
Redirect Domain | www.departement18.fr |
Redirect Base | departement18.fr |
Domain IPs | 213.182.41.202 |
Redirect IPs | 87.98.187.73 |
Response IP | 87.98.187.73 |
Found | Yes |
Hash | 611e7395ed5a9396164960b9155ca3000008a65f3dd6708b5ac5959234733742 |
SimHash | 6f029e708f31 |
Groups
*
Rule | Path |
---|---|
Allow | /local/cache-css/ |
Allow | /local/cache-js/ |
Disallow | /ecrire/ |
Disallow | /lib/ |
Disallow | /prive/ |
Disallow | /spip.php?action=* |
Disallow | /spip.php?page=login* |
Disallow | /*.api/ |
Other Records
Field | Value |
---|---|
crawl-delay | 1 |
Other Records
Field | Value |
---|---|
sitemap | https://www.departement18.fr/sitemap.xml |
Warnings
- `noindex` is not a known field.
Comments