getclue.com
robots.txt
Robots Exclusion Standard data for getclue.com
Resource Scan
Scan Details
| Site Domain | getclue.com |
| Base Domain | getclue.com |
| Scan Status | Ok |
| Last Scan | 2025-10-22T21:26:20+00:00 |
| Next Scan | 2025-11-21T21:26:20+00:00 |
Last Scan
| Scanned | 2025-10-22T21:26:20+00:00 |
| URL | https://getclue.com/robots.txt |
| Redirect | https://www.getclue.com/robots.txt |
| Redirect Domain | www.getclue.com |
| Redirect Base | getclue.com |
| Domain IPs | 104.21.39.199, 172.67.171.88, 2606:4700:3031::6815:27c7, 2606:4700:3034::ac43:ab58 |
| Redirect IPs | 198.202.211.1, 2620:cb:2000::1 |
| Response IP | 198.202.211.1 |
| Found | Yes |
| Hash | 65679baa62780bf1a11136ef4f8591ad33c3e3ab8a0959a6978c8e728b743b18 |
| SimHash | a120b9618899 |
Groups
*
| Rule | Path |
|---|---|
| Disallow | /cgi-bin |
| Disallow | /?* |
| Disallow | *?s= |
| Disallow | *%26s%3D |
| Disallow | /search/ |
| Disallow | /author/ |
| Disallow | /users/ |
| Disallow | */trackback |
| Disallow | */embed |
| Disallow | *utm*%3D |
| Disallow | *openstat%3D |
| Disallow | /*.php |
| Allow | */uploads |
| Allow | /*/*.js |
| Allow | /*/*.css |
Other Records
| Field | Value |
|---|---|
| sitemap | https://www.getclue.com/sitemap.xml |