gerpac.eu
robots.txt

Robots Exclusion Standard data for gerpac.eu

Resource Scan

Scan Details

Site Domain gerpac.eu
Base Domain gerpac.eu
Scan Status Ok
Last Scan2025-10-07T08:41:45+00:00
Next Scan 2025-11-06T08:41:45+00:00

Last Scan

Scanned2025-10-07T08:41:45+00:00
URL https://gerpac.eu/robots.txt
Redirect https://www.gerpac.eu/robots.txt
Redirect Domain www.gerpac.eu
Redirect Base gerpac.eu
Domain IPs 2001:41d0:1:1b00:213:186:33:17, 46.105.204.6
Redirect IPs 2001:41d0:1:1b00:213:186:33:17, 46.105.204.6
Response IP 46.105.204.6
Found Yes
Hash bb62990dbdff89c0d1e37d55db49f1b3bd53c5cd77f0a304f1e9e17bf4c6a5cc
SimHash 6b03de70b7b1

Groups

*

Rule Path
Allow /local/cache-css/
Allow /local/cache-js/
Disallow /ecrire/
Disallow /lib/
Disallow /prive/
Disallow /spip.php?action=*
Disallow /spip.php?page=login*
Disallow /*.api/

Other Records

Field Value
crawl-delay 1

Other Records

Field Value
sitemap https://www.gerpac.eu/sitemap.xml

Comments

  • robots.txt
  • @url: https://www.gerpac.eu
  • @template: squelettes-dist/robots.txt.html

Warnings

  • `noindex` is not a known field.