my.guideline.com
robots.txt

Robots Exclusion Standard data for my.guideline.com

Resource Scan

Scan Details

Site Domain my.guideline.com
Base Domain guideline.com
Scan Status Ok
Last Scan2025-08-25T09:08:01+00:00
Next Scan 2025-09-08T09:08:01+00:00

Last Scan

Scanned2025-08-25T09:08:01+00:00
URL https://my.guideline.com/robots.txt
Domain IPs 172.66.40.187, 172.66.43.69, 2606:4700:3108::ac42:28bb, 2606:4700:3108::ac42:2b45
Response IP 172.66.43.69
Found Yes
Hash d10dfc1f3f3c79feb711113d1b2dbf2ca00ba42cad55231ab08fffcf33c64bfa
SimHash 89edc3602101

Groups

*

Rule Path
Allow /$
Allow /.well-known
Allow /advisor/
Allow /agreements/
Allow /assets
Allow /connect_with_payroll
Allow /explore/
Allow /get-started
Allow /ira/
Allow /login
Allow /participant/
Allow /passwords/
Allow /savers/
Allow /sitemap.xml
Allow /sponsor/dashboard/
Allow /pipeline/
Allow /sep/
Disallow /
Disallow /cdn-cgi/

Other Records

Field Value
sitemap https://my.guideline.com/sitemap.xml