bhsg.com
robots.txt

Robots Exclusion Standard data for bhsg.com

Resource Scan

Scan Details

Site Domain bhsg.com
Base Domain bhsg.com
Scan Status Ok
Last Scan2025-12-22T04:30:44+00:00
Next Scan 2025-12-29T04:30:44+00:00

Last Scan

Scanned2025-12-22T04:30:44+00:00
URL https://bhsg.com/robots.txt
Domain IPs 104.26.2.17, 104.26.3.17, 172.67.71.253, 2606:4700:20::681a:211, 2606:4700:20::681a:311, 2606:4700:20::ac43:47fd
Response IP 172.67.71.253
Found Yes
Hash f0e15c01daf4db3dbdeed8e4964b631f9e2bc3a16ecee000ffc68a0c08ce7fb9
SimHash c25a1c723637

Groups

*

Rule Path
Disallow /cpresources/
Disallow /vendor/
Disallow /.env
Disallow /cache/

bytespider

Rule Path
Disallow /

meta-externalagent

Rule Path
Disallow /

timpibot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://bhsg.com/sitemap.xml

Comments

  • robots.txt for https://bhsg.com/
  • live - don't allow web crawlers to index cpresources/ or vendor/