pressegauche.org
robots.txt

Robots Exclusion Standard data for pressegauche.org

Resource Scan

Scan Details

Site Domain pressegauche.org
Base Domain pressegauche.org
Scan Status Ok
Last Scan2025-10-23T04:18:28+00:00
Next Scan 2025-11-22T04:18:28+00:00

Last Scan

Scanned2025-10-23T04:18:28+00:00
URL https://pressegauche.org/robots.txt
Redirect https://www.pressegauche.org/robots.txt
Redirect Domain www.pressegauche.org
Redirect Base pressegauche.org
Domain IPs 199.58.80.33
Redirect IPs 199.58.80.35
Response IP 199.58.80.35
Found Yes
Hash 1a7905d4cb7db7cbdf8ab156dcb00344bad1cd5b1bfa4bc1c1ffe9f1152c8b05
SimHash 6b07de309f91

Groups

*

Rule Path
Allow /local/cache-css/
Allow /local/cache-js/
Disallow /ecrire/
Disallow /lib/
Disallow /prive/
Disallow /spip.php?action=*
Disallow /spip.php?page=login*
Disallow /*.api/

Other Records

Field Value
crawl-delay 1

Other Records

Field Value
sitemap https://www.pressegauche.org/sitemap.xml

Comments

  • robots.txt
  • @url: https://www.pressegauche.org
  • @generator: SPIP 4.3.9
  • @template: squelettes-dist/robots.txt.html

Warnings

  • `noindex` is not a known field.