bookhitch.com
robots.txt

Robots Exclusion Standard data for bookhitch.com

Resource Scan

Scan Details

Site Domain bookhitch.com
Base Domain bookhitch.com
Scan Status Ok
Last Scan2025-03-29T20:56:57+00:00
Next Scan 2025-04-28T20:56:57+00:00

Last Scan

Scanned2025-03-29T20:56:57+00:00
URL https://bookhitch.com/robots.txt
Domain IPs 109.234.162.163
Response IP 109.234.162.163
Found Yes
Hash 98aa4c8ad7dcea3d2285d1eaccd8b11cd704103c381aed27dd2df257cbd39853
SimHash 4962f893f245

Groups

*

Rule Path
Disallow /wp-admin
Disallow /wp-includes
Disallow /wp-content/plugins
Disallow /wp-content/cache
Disallow /trackback
Disallow /feed
Disallow /comments
Disallow /category/*/*
Disallow */trackback
Disallow */feed
Disallow */comments
Disallow /*.pdf$
Disallow /*?*
Disallow /*?
Disallow /wp-login.php
Allow /wp-content/uploads

googlebot

Rule Path
Disallow /*.php$
Disallow /*.inc$
Disallow /*.gz$
Disallow /*.pdf$

googlebot-image

Rule Path
Disallow
Allow /*

mediapartners-google*

Rule Path
Disallow
Allow /*

ahrefssiteaudit

Rule Path
Allow /*

ahrefsbot

Rule Path
Allow /*

Comments

  • On empêche l'indexation des dossiers sensibles
  • On désindexe toutes les URL ayant des paramètres (duplication de contenu)
  • On désindexe la page de connexion (contenu inutile)
  • On autorise l'indexation des images
  • On empêche l'indexation des fichiers sensibles
  • Autoriser Google Image
  • Autoriser Google AdSense
  • Autoriser Ahrefs