duodisplay.com
robots.txt

Robots Exclusion Standard data for duodisplay.com

Resource Scan

Scan Details

Site Domain duodisplay.com
Base Domain duodisplay.com
Scan Status Ok
Last Scan2024-06-10T06:20:59+00:00
Next Scan 2024-07-10T06:20:59+00:00

Last Scan

Scanned2024-06-10T06:20:59+00:00
URL https://duodisplay.com/robots.txt
Domain IPs 2001:41d0:301::23, 46.105.204.23
Response IP 46.105.204.23
Found Yes
Hash 314da85c616d48f1491b01c2b3a3c001398b497c4687ec106ecfb233dfbdedc1
SimHash 414298c1a257

Groups

*

Rule Path
Disallow /wp-admin
Disallow /wp-includes
Disallow /wp-content/plugins
Disallow /wp-content/cache
Disallow /trackback
Disallow /feed
Disallow /comments
Disallow /category/*/*
Disallow */trackback
Disallow */feed
Disallow */comments
Disallow /*.pdf$
Disallow /wp-login.php
Allow /wp-content/uploads

googlebot

Rule Path
Disallow /*.php$
Disallow /*.inc$
Disallow /*.gz$
Disallow /*.pdf$

googlebot-image

Rule Path
Disallow
Allow /*

mediapartners-google*

Rule Path
Disallow
Allow /*

Other Records

Field Value
sitemap https://duodisplay.com/sitemap_index.xml

Comments

  • On empêche l'indexation des dossiers sensibles
  • On désindexe la page de connexion (contenu inutile)
  • On autorise l'indexation des images
  • On empêche l'indexation des fichiers sensibles
  • Autoriser Google Image
  • Autoriser Google AdSense
  • On indique au spider le lien vers notre sitemap