trailheadcu.org
robots.txt

Robots Exclusion Standard data for trailheadcu.org

Resource Scan

Scan Details

Site Domain trailheadcu.org
Base Domain trailheadcu.org
Scan Status Ok
Last Scan2025-12-21T09:35:49+00:00
Next Scan 2026-01-20T09:35:49+00:00

Last Scan

Scanned2025-12-21T09:35:49+00:00
URL https://trailheadcu.org/robots.txt
Domain IPs 104.26.8.41, 104.26.9.41, 172.67.71.112, 2606:4700:20::681a:829, 2606:4700:20::681a:929, 2606:4700:20::ac43:4770
Response IP 104.26.8.41
Found Yes
Hash 3addd0c2a6711b26ecc5b554e3ac9549ffe4e02a5edaa175aff91d76d1f2c6fd
SimHash c8655e80a942

Groups

*

Rule Path
Disallow /wp-
Disallow /search
Disallow /feed
Disallow /comments/feed
Disallow /feed/$
Disallow /*/feed/$
Disallow /*/feed/rss/$
Disallow /*/trackback/$
Disallow /*/*/feed/$
Disallow /*/*/feed/rss/$
Disallow /*/*/trackback/$
Disallow /*/*/*/feed/$
Disallow /*/*/*/feed/rss/$
Disallow /*/*/*/trackback/$