creuse.fr
robots.txt

Robots Exclusion Standard data for creuse.fr

Resource Scan

Scan Details

Site Domain creuse.fr
Base Domain creuse.fr
Scan Status Ok
Last Scan2025-10-20T01:36:39+00:00
Next Scan 2025-11-19T01:36:39+00:00

Last Scan

Scanned2025-10-20T01:36:39+00:00
URL https://creuse.fr/robots.txt
Redirect http://www.creuse.fr/robots.txt
Redirect Domain www.creuse.fr
Redirect Base creuse.fr
Domain IPs 2001:41d0:1:1b00:213:186:33:18, 213.186.33.18
Redirect IPs 2001:41d0:1:1b00:213:186:33:18, 213.186.33.18
Response IP 213.186.33.18
Found Yes
Hash 920a5ecba03c016a3ef7048c7a83df8ce0588cd4433fe7f11f13197814e3f19a
SimHash 6b07de748fb1

Groups

*

Rule Path
Allow /local/cache-css/
Allow /local/cache-js/
Disallow /ecrire/
Disallow /lib/
Disallow /prive/
Disallow /spip.php?action=*
Disallow /spip.php?page=login*
Disallow /*.api/

Other Records

Field Value
crawl-delay 1

Other Records

Field Value
sitemap https://www.creuse.fr/sitemap.xml

Comments

  • robots.txt
  • @url: https://www.creuse.fr
  • @generator: SPIP 4.3.9
  • @template: squelettes-dist/robots.txt.html

Warnings

  • `noindex` is not a known field.