trailheadcu.org
robots.txt
Robots Exclusion Standard data for trailheadcu.org
Resource Scan
Scan Details
| Site Domain | trailheadcu.org |
| Base Domain | trailheadcu.org |
| Scan Status | Ok |
| Last Scan | 2025-12-21T09:35:49+00:00 |
| Next Scan | 2026-01-20T09:35:49+00:00 |
Last Scan
| Scanned | 2025-12-21T09:35:49+00:00 |
| URL | https://trailheadcu.org/robots.txt |
| Domain IPs | 104.26.8.41, 104.26.9.41, 172.67.71.112, 2606:4700:20::681a:829, 2606:4700:20::681a:929, 2606:4700:20::ac43:4770 |
| Response IP | 104.26.8.41 |
| Found | Yes |
| Hash | 3addd0c2a6711b26ecc5b554e3ac9549ffe4e02a5edaa175aff91d76d1f2c6fd |
| SimHash | c8655e80a942 |
Groups
*
| Rule | Path |
|---|---|
| Disallow | /wp- |
| Disallow | /search |
| Disallow | /feed |
| Disallow | /comments/feed |
| Disallow | /feed/$ |
| Disallow | /*/feed/$ |
| Disallow | /*/feed/rss/$ |
| Disallow | /*/trackback/$ |
| Disallow | /*/*/feed/$ |
| Disallow | /*/*/feed/rss/$ |
| Disallow | /*/*/trackback/$ |
| Disallow | /*/*/*/feed/$ |
| Disallow | /*/*/*/feed/rss/$ |
| Disallow | /*/*/*/trackback/$ |