hcfcu.com
robots.txt

Robots Exclusion Standard data for hcfcu.com

Resource Scan

Scan Details

Site Domain hcfcu.com
Base Domain hcfcu.com
Scan Status Ok
Last Scan2025-06-25T13:55:46+00:00
Next Scan 2025-07-25T13:55:46+00:00

Last Scan

Scanned2025-06-25T13:55:46+00:00
URL https://hcfcu.com/robots.txt
Domain IPs 216.206.109.129
Response IP 216.206.109.129
Found Yes
Hash 1f675d579eb14c9320280403c57f293a1d938bca1573bd711be16e44de594756
SimHash 7954b35357bd

Groups

*

Rule Path
Disallow /category/
Disallow /*/trackback/$
Disallow /plesk-stat/$
Disallow /*/feed/$
Allow /feed/$

googlebot-image

Rule Path
Allow /*

ia_archiver

Rule Path
Disallow /*

duggmirror

Rule Path
Disallow /*

Comments

  • disallow all files in these directories
  • Disallow: /wp-*
  • Disallow: /contact/
  • I’m not interested in being found by who I am, only by what I post.
  • Disallow: /about/
  • Disallow: /*?*
  • Disallow: /mint$
  • Disallow: /feeder/$
  • allow google image bot to search all images
  • disallow archiving site
  • disable duggmirror