papa-blogueur.fr
robots.txt

Robots Exclusion Standard data for papa-blogueur.fr

Resource Scan

Scan Details

Site Domain papa-blogueur.fr
Base Domain papa-blogueur.fr
Scan Status Ok
Last Scan2024-07-02T12:51:43+00:00
Next Scan 2024-07-09T12:51:43+00:00

Last Scan

Scanned2024-07-02T12:51:43+00:00
URL https://papa-blogueur.fr/robots.txt
Domain IPs 109.234.166.244
Response IP 109.234.166.244
Found Yes
Hash a64876c83511a814bf287124de45c7f71a0a21be04c18c30d5bbaf1598ec5c95
SimHash 4940b8d2b205

Groups

*

Rule Path
Disallow /wp-admin
Disallow /wp-includes
Disallow /wp-content/plugins
Disallow /wp-content/cache
Disallow /trackback
Disallow /feed
Disallow /comments
Disallow /category/*/*
Disallow */trackback
Disallow */feed
Disallow */comments
Disallow /*.pdf$
Disallow /*?*
Disallow /*?
Disallow /wp-login.php
Allow /wp-content/uploads

googlebot

Rule Path
Disallow /*.php$
Disallow /*.inc$
Disallow /*.gz$
Disallow /*.pdf$

googlebot-image

Rule Path
Disallow
Allow /*

mediapartners-google*

Rule Path
Disallow
Allow /*

ahrefssiteaudit

Rule Path
Allow /*

ahrefsbot

Rule Path
Allow /*

Other Records

Field Value
sitemap https://www.papa-blogueur.fr/sitemap_index.xml

Comments

  • On empêche l'indexation des dossiers sensibles
  • On désindexe toutes les URL ayant des paramètres (duplication de contenu)
  • On désindexe la page de connexion (contenu inutile)
  • On autorise l'indexation des images
  • On empêche l'indexation des fichiers sensibles
  • Autoriser Google Image
  • Autoriser Google AdSense
  • Autoriser Ahrefs
  • On indique au spider le lien vers notre sitemap