touteslesbox.fr
robots.txt

Robots Exclusion Standard data for touteslesbox.fr

Resource Scan

Scan Details

Site Domain touteslesbox.fr
Base Domain touteslesbox.fr
Scan Status Ok
Last Scan2025-10-29T19:06:59+00:00
Next Scan 2025-11-05T19:06:59+00:00

Last Scan

Scanned2025-10-29T19:06:59+00:00
URL https://touteslesbox.fr/robots.txt
Domain IPs 146.88.235.94
Response IP 146.88.235.94
Found Yes
Hash 40fc9ce295b710057396a37972e9e668364ceba5adee40212d363bc6ed99a643
SimHash 2240d693d622

Groups

*

Rule Path
Disallow /cgi-bin
Allow /*.js$
Allow /*.css$

Other Records

Field Value
crawl-delay 10

googlebot

Rule Path
Allow .js
Allow .css
Allow /wp-includes
Allow /wp-content
Disallow /cgi-bin
Disallow /wp-admin
Disallow /wp-includes
Disallow /wp-content
Allow /feed-smart-food/
Disallow /feed
Disallow /*/feed
Allow /comment-lancer-sa-box-mensuelle/
Disallow /comments

alexabot
mediapartners-google
adsbot-google
googlebot-image
googlebot-mobile
ia_archiver-web.archive.org
googlebot
googlebot-image
googlebot-mobile
msnbot
slurp
teoma
twiceler
gigabot
scrubby
robozilla
nutch
ia_archiver
baiduspider
naverbot
yeti
yahoo-mmcrawler
psbot
asterias
yahoo-blogs/v3.9

No rules defined. All paths allowed.

Other Records

Field Value
sitemap https://touteslesbox.fr/sitemap_index.xml

Comments

  • Unblock All CSS & Javascript for bot
  • Unblock All CSS & Javascript for Googlebot
  • Disallow
  • Disallow: */comment-* # Blocks the Comments Permalinks and Comment Pages
  • Allow bots