gesica.org
robots.txt

Robots Exclusion Standard data for gesica.org

Resource Scan

Scan Details

Site Domain gesica.org
Base Domain gesica.org
Scan Status Ok
Last Scan2026-02-24T19:31:00+00:00
Next Scan 2026-03-26T19:31:00+00:00

Last Scan

Scanned2026-02-24T19:31:00+00:00
URL https://gesica.org/robots.txt
Redirect https://www.gesica.org/robots.txt
Redirect Domain www.gesica.org
Redirect Base gesica.org
Domain IPs 185.100.4.27
Redirect IPs 185.100.4.27
Response IP 185.100.4.27
Found Yes
Hash 6a884ce154428754918f79158e2ae50209d5cf5b7a5c7f76efed13621aab2813
SimHash e81211a28f31

Groups

*

Rule Path
Disallow */author/*
Disallow /wp-login.php
Disallow /wp-includes
Disallow /trackback
Disallow /*.php$
Disallow /*.inc$
Allow /wp-includes/js/*
Allow /wp-includes/css/*

wprocketbot

Rule Path
Allow /
Disallow /nos-guides-juridiques*?type_guide=autres
Disallow /wp-content/uploads/2025/06/GESICA_Annuaire2023.pdf

Other Records

Field Value
sitemap https://www.gesica.org/sitemap_index.xml

Comments

  • On empeche l'indexation des dossiers sensibles
  • Url avec paramètres et éléments dupliqués
  • Url spécifiques a désindexer