getclue.com
robots.txt

Robots Exclusion Standard data for getclue.com

Resource Scan

Scan Details

Site Domain getclue.com
Base Domain getclue.com
Scan Status Ok
Last Scan2025-10-22T21:26:20+00:00
Next Scan 2025-11-21T21:26:20+00:00

Last Scan

Scanned2025-10-22T21:26:20+00:00
URL https://getclue.com/robots.txt
Redirect https://www.getclue.com/robots.txt
Redirect Domain www.getclue.com
Redirect Base getclue.com
Domain IPs 104.21.39.199, 172.67.171.88, 2606:4700:3031::6815:27c7, 2606:4700:3034::ac43:ab58
Redirect IPs 198.202.211.1, 2620:cb:2000::1
Response IP 198.202.211.1
Found Yes
Hash 65679baa62780bf1a11136ef4f8591ad33c3e3ab8a0959a6978c8e728b743b18
SimHash a120b9618899

Groups

*

Rule Path
Disallow /cgi-bin
Disallow /?*
Disallow *?s=
Disallow *%26s%3D
Disallow /search/
Disallow /author/
Disallow /users/
Disallow */trackback
Disallow */embed
Disallow *utm*%3D
Disallow *openstat%3D
Disallow /*.php
Allow */uploads
Allow /*/*.js
Allow /*/*.css

Other Records

Field Value
sitemap https://www.getclue.com/sitemap.xml