guidehorse.com
robots.txt

Robots Exclusion Standard data for guidehorse.com

Resource Scan

Scan Details

Site Domain guidehorse.com
Base Domain guidehorse.com
Scan Status Ok
Last Scan2025-10-18T19:58:08+00:00
Next Scan 2025-11-17T19:58:08+00:00

Last Scan

Scanned2025-10-18T19:58:08+00:00
URL https://guidehorse.com/robots.txt
Redirect https://www.guidehorse.com/robots.txt
Redirect Domain www.guidehorse.com
Redirect Base guidehorse.com
Domain IPs 104.21.14.44, 172.67.157.187, 2606:4700:3033::6815:e2c, 2606:4700:3036::ac43:9dbb
Redirect IPs 104.21.14.44, 172.67.157.187, 2606:4700:3033::6815:e2c, 2606:4700:3036::ac43:9dbb
Response IP 104.21.14.44
Found Yes
Hash b6946c460790b2e01127b6ebbc9b01a1b0d4ad46440c408b21ef12c2f5982e00
SimHash 6857dd42ae44

Groups

*

Rule Path
Disallow *?utm_content*
Disallow /wp-content/plugins
Disallow /wp-content/cache
Disallow /tag
Disallow /cgi-bin
Disallow /wp-admin
Disallow /wp-includes
Disallow /attachment
Disallow /?feed=comments-rss2
Disallow /?feed=rss
Disallow /author
Disallow /?feed=atom
Disallow /feed
Disallow /wp-content/themes
Disallow /wp-register.php
Disallow /wp-login.php
Disallow /trackback
Disallow /xmlrpc.php?rsd
Disallow /?s=*
Disallow /*?*
Disallow /page
Disallow /*?
Disallow /*?utm_source*
Disallow *?utm_medium*
Disallow /?feed=rss2
Disallow /xmlrpc.php