cg18.fr
robots.txt

Robots Exclusion Standard data for cg18.fr

Resource Scan

Scan Details

Site Domain cg18.fr
Base Domain cg18.fr
Scan Status Ok
Last Scan2025-04-06T04:27:07+00:00
Next Scan 2025-05-06T04:27:07+00:00

Last Scan

Scanned2025-04-06T04:27:07+00:00
URL https://cg18.fr/robots.txt
Redirect https://www.departement18.fr/robots.txt
Redirect Domain www.departement18.fr
Redirect Base departement18.fr
Domain IPs 213.182.41.202
Redirect IPs 87.98.187.73
Response IP 87.98.187.73
Found Yes
Hash 611e7395ed5a9396164960b9155ca3000008a65f3dd6708b5ac5959234733742
SimHash 6f029e708f31

Groups

*

Rule Path
Allow /local/cache-css/
Allow /local/cache-js/
Disallow /ecrire/
Disallow /lib/
Disallow /prive/
Disallow /spip.php?action=*
Disallow /spip.php?page=login*
Disallow /*.api/

Other Records

Field Value
crawl-delay 1

Other Records

Field Value
sitemap https://www.departement18.fr/sitemap.xml

Comments

  • robots.txt
  • @url: https://www.departement18.fr
  • @template: plugins/auto/entravaux/v5.1.0/robots.txt.html

Warnings

  • `noindex` is not a known field.