demarches.numerique.gouv.fr
robots.txt

Robots Exclusion Standard data for demarches.numerique.gouv.fr

Resource Scan

Scan Details

Site Domain demarches.numerique.gouv.fr
Base Domain numerique.gouv.fr
Scan Status Ok
Last Scan2025-08-13T22:27:35+00:00
Next Scan 2025-09-12T22:27:35+00:00

Last Scan

Scanned2025-08-13T22:27:35+00:00
URL https://demarches.numerique.gouv.fr/robots.txt
Domain IPs 176.31.79.200
Response IP 176.31.79.200
Found Yes
Hash bffdbe6ba63092db44efd905fe8796f59e6cd32b5b7921ad8e864c7aa2a365dd
SimHash 32c029877570

Groups

*

Rule Path
Disallow /commencer*
Disallow /rails/
Disallow /super_admins/
Disallow /manager/
Disallow /users/
Allow /users/sign_in
Allow /users/sign_up
Disallow /connexion-par-jeton/
Disallow */reset-link-sent*
Disallow /lien-envoye
Disallow /france_connect/

Comments

  • See http://www.robotstxt.org/robotstxt.html for documentation on how to use the robots.txt file
  • To ban all spiders from the entire site uncomment the next two lines: