blogdesparents.fr
robots.txt

Robots Exclusion Standard data for blogdesparents.fr

Resource Scan

Scan Details

Site Domain blogdesparents.fr
Base Domain blogdesparents.fr
Scan Status Ok
Last Scan2024-10-01T06:57:32+00:00
Next Scan 2024-10-08T06:57:32+00:00

Last Scan

Scanned2024-10-01T06:57:32+00:00
URL https://blogdesparents.fr/robots.txt
Domain IPs 2001:41d0:1:1b00:213:186:33:2, 213.186.33.2
Response IP 213.186.33.2
Found Yes
Hash 399661ac470ae5a8f198d5a4105de3b9c551d36dda682219cf549f39378d945b
SimHash 410859444ff1

Groups

*

Rule Path
Disallow /wp-login.php
Disallow */trackback
Disallow /*/comments
Disallow /cgi-bin
Disallow /*.php$
Disallow /*.inc$
Disallow /*.gz
Disallow /*.cgi
Allow /*css?*
Allow /*js?*
Allow /*?utm*
Allow /css/?

googlebot-image

Rule Path
Allow /*

mediapartners-google*

Rule Path
Allow /*

Other Records

Field Value
sitemap https://www.blogdesparents.fr/sitemap_index.xml

Comments

  • URLs que je ne veux pas indexer : Login Trackbacks Commentaires
  • URLs autorisées CSS JS Analytics pour les Bots
  • Autoriser Google Image
  • Autoriser Google AdSense