institut-kervegan.com
robots.txt

Robots Exclusion Standard data for institut-kervegan.com

Resource Scan

Scan Details

Site Domain institut-kervegan.com
Base Domain institut-kervegan.com
Scan Status Ok
Last Scan2024-09-25T17:15:04+00:00
Next Scan 2024-10-25T17:15:04+00:00

Last Scan

Scanned2024-09-25T17:15:04+00:00
URL https://institut-kervegan.com/robots.txt
Redirect https://www.institut-kervegan.com/robots.txt
Redirect Domain www.institut-kervegan.com
Redirect Base institut-kervegan.com
Domain IPs 213.186.33.24
Redirect IPs 213.186.33.24
Response IP 213.186.33.24
Found Yes
Hash dfb55c69556cbe3d62e4ef239f2b03585831a8b1fbef83417a6865bce2cf5214
SimHash 684cdfc06678

Groups

*

Rule Path
Disallow /wp-*
Disallow /cgi-bin/
Disallow */trackback
Disallow /*/feed
Disallow /*/comments
Disallow /*?
Disallow /*.php$
Disallow /*.inc$
Disallow /*.gz$
Disallow /*.swf$
Disallow /*.wmv$
Disallow /*.cgi$
Disallow /*.xhtml$
Disallow /xmlrpc.php
Allow /wp-content/uploads/
Allow /*.css?*
Allow /*.js?*
Allow /*.css
Allow /*.js
Allow /*.min.css
Allow /*.min.js
Allow /*.jpg
Allow /*.jpeg
Allow /*.gif
Allow /*.png
Allow /*.svg
Allow /*.woff?*
Allow /*.woff2?*
Allow /*.ttf?*

Comments

  • Directories
  • Paths (clean URLs)
  • Files
  • Allow images, CSS et JS
  • Google Image
  • User-agent: Googlebot-Image