cacerfarien.fr
robots.txt

Robots Exclusion Standard data for cacerfarien.fr

Resource Scan

Scan Details

Site Domain cacerfarien.fr
Base Domain cacerfarien.fr
Scan Status Ok
Last Scan2026-02-18T20:43:21+00:00
Next Scan 2026-02-25T20:43:21+00:00

Last Scan

Scanned2026-02-18T20:43:21+00:00
URL https://cacerfarien.fr/robots.txt
Domain IPs 104.21.35.141, 172.67.175.97, 2606:4700:3032::6815:238d, 2606:4700:3034::ac43:af61
Response IP 104.21.35.141
Found Yes
Hash c2988950276270257ccb34489f29656d196f26af9c828a96c2512d63feefb563
SimHash e830437bcf99

Groups

*

Rule Path
Disallow /wp-admin/
Disallow /wp-includes/
Disallow /wp-content/plugins/
Disallow /wp-content/themes/
Disallow /wp-content/cache/
Disallow /cgi-bin/
Disallow /trackback/
Disallow /xmlrpc.php
Disallow /feed/
Disallow /comments/
Disallow /category/
Disallow /tag/
Disallow /author/
Disallow /search/
Disallow /?s=
Allow /wp-content/uploads/
Disallow /*?replytocom
Disallow /*?orderby=
Disallow /*?filter_
Disallow /*.js$
Disallow /*.css$

Other Records

Field Value
sitemap https://cacerfarien.fr/sitemap_index.xml

Comments

  • Allow access to specific files in wp-content
  • Block specific URL parameters that cause duplicate content
  • Block unnecessary scripts and styles
  • Sitemap