bien-et-bio.info
robots.txt

Robots Exclusion Standard data for bien-et-bio.info

Resource Scan

Scan Details

Site Domain bien-et-bio.info
Base Domain bien-et-bio.info
Scan Status Ok
Last Scan2024-09-17T05:35:46+00:00
Next Scan 2024-10-17T05:35:46+00:00

Last Scan

Scanned2024-09-17T05:35:46+00:00
URL https://bien-et-bio.info/robots.txt
Domain IPs 2001:41d0:1:1b00:213:186:33:2, 213.186.33.2
Response IP 213.186.33.2
Found Yes
Hash bd97ea99e10ab678d126d704b6e0255174c424b54091151df5df85dc23428050
SimHash 2b141d84e213

Groups

*

Rule Path
Disallow /wp-login.php
Disallow */trackback
Disallow /*/*comments
Disallow /*/feed/
Disallow /*/*respond$
Disallow /?filter_by=*
Disallow /?s=*
Disallow /cgi-bin
Disallow /*.php$
Disallow /*.inc$
Disallow /*.gz
Disallow /*.cgi

googlebot-image

Rule Path
Disallow

mediapartners-google

Rule Path
Disallow
Allow /*.png$
Allow /*.jpg$
Allow /*.gif$
Allow /wp-includes/*
Allow /wp-content/*
Allow /wp-content/themes/*
Allow /wp-content/plugins/*
Allow /wp-content/uploads/*
Allow /wp-content/cache/*
Allow /*.js$
Allow /*.css$
Allow */css/*
Allow */js/*
Allow /wp-content*.css*
Allow /wp-content*.js*
Allow /wp-includes*.css*
Allow /wp-includes*.js*

Other Records

Field Value
sitemap https://www.bien-et-bio.info/sitemap_index.xml

Comments

  • On empeche l'indexation du DC et Rss
  • On empeche l'indexation des recherches et facettes
  • On empeche l'indexation des dossiers sensibles
  • On autorise Google Image
  • On autorise Google Adsense
  • On autorise les dossiers et fichiers qui bloquent
  • Lien des sitemaps