eurekaweb.fr
robots.txt

Robots Exclusion Standard data for eurekaweb.fr

Resource Scan

Scan Details

Site Domain eurekaweb.fr
Base Domain eurekaweb.fr
Scan Status Ok
Last Scan2026-01-30T10:07:10+00:00
Next Scan 2026-02-06T10:07:10+00:00

Last Scan

Scanned2026-01-30T10:07:10+00:00
URL https://eurekaweb.fr/robots.txt
Domain IPs 213.186.33.3
Response IP 213.186.33.3
Found Yes
Hash 9fe1e8b0e4e67a23a9561fa9899e15a4a8f5bc8e804f25260a3287abec4f628a
SimHash 6d30d1c87635

Groups

*

Rule Path
Disallow /wp/wp-admin
Disallow /wp/wp-includes
Disallow /wp/wp-content/plugins
Disallow /wp/wp-content/cache
Disallow /wp/wp-content/themes
Disallow /wp/trackback
Disallow /wp/feed
Disallow /wp/comments
Disallow /wp/category/*/*
Disallow */wp/trackback
Disallow */wp/feed
Disallow */wp/comments
Disallow /wp/*.pdf$
Disallow /wp/*?*
Disallow /wp/*?
Disallow /wp/wp-login.php
Allow /wp/wp-content/uploads

googlebot

Rule Path
Disallow /wp/*.php$
Disallow /wp/*.inc$
Disallow /wp/*.gz$
Disallow /wp/*.swf$
Disallow /wp/*.wmv$
Disallow /wp/*.cgi$
Disallow /wp/*.pdf$

googlebot-image

Rule Path
Disallow
Allow /wp/*

mediapartners-google*

Rule Path
Disallow
Allow /wp/*

Other Records

Field Value
sitemap http://eurekaweb.fr/wp/sitemap_index.xml

Comments

  • On empêche l'indexation des dossiers sensibles
  • On désindexe tous les URL ayant des paramètres (duplication de contenu)
  • On désindexe la page de connexion (contenu inutile)
  • On autorise l'indexation des images
  • On empêche l'indexation des fichiers sensibles
  • Autoriser Google Image
  • Autoriser Google AdSense
  • On indique au spider le lien vers notre sitemap
  • Note : Indiquez bien le lien vers votre sitemap. Rendez-vous dans la section Sitemaps XML de WordPress SEO pour en obtenir le lien