ihst.org
robots.txt

Robots Exclusion Standard data for ihst.org

Resource Scan

Scan Details

Site Domain ihst.org
Base Domain ihst.org
Scan Status Ok
Last Scan2025-06-06T07:03:21+00:00
Next Scan 2025-07-06T07:03:21+00:00

Last Scan

Scanned2025-06-06T07:03:21+00:00
URL https://ihst.org/robots.txt
Redirect https://crfh.net/robots.txt
Redirect Domain crfh.net
Redirect Base crfh.net
Domain IPs 104.21.22.50, 172.67.202.241, 2606:4700:3033::ac43:caf1, 2606:4700:3034::6815:1632
Redirect IPs 104.21.112.1, 104.21.16.1, 104.21.32.1, 104.21.48.1, 104.21.64.1, 104.21.80.1, 104.21.96.1, 2606:4700:3030::6815:1001, 2606:4700:3030::6815:2001, 2606:4700:3030::6815:3001, 2606:4700:3030::6815:4001, 2606:4700:3030::6815:5001, 2606:4700:3030::6815:6001, 2606:4700:3030::6815:7001
Response IP 104.21.16.1
Found Yes
Hash 84e1bd90e1c95a62cc6067c95aae3e36bfb69ff9b25b58d9ddfd2137313246df
SimHash 4bc4d81443f2

Groups

*

Rule Path
Allow /
Disallow /wp-admin/
Disallow /*/trackback
Disallow /img/
Disallow /tag/
Disallow /feed
Disallow /*/feed
Disallow /?s=*
Disallow /?link=*
Disallow /attachment/
Disallow /author/
Disallow /page/*
Disallow /*?utm_source
Disallow /*%26utm_source

Other Records

Field Value
sitemap https://crfh.net/sitemap.xml