readthegeneralist.com
robots.txt

Robots Exclusion Standard data for readthegeneralist.com

Resource Scan

Scan Details

Site Domain readthegeneralist.com
Base Domain readthegeneralist.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2025-09-19T16:10:42+00:00
Next Scan 2025-11-18T16:10:42+00:00

Last Successful Scan

Scanned2025-06-29T14:42:52+00:00
URL https://readthegeneralist.com/robots.txt
Redirect https://www.generalist.com/robots.txt
Redirect Domain www.generalist.com
Redirect Base generalist.com
Domain IPs 75.2.70.75, 99.83.190.102
Redirect IPs 104.26.6.229, 104.26.7.229, 172.67.71.156, 2606:4700:20::681a:6e5, 2606:4700:20::681a:7e5, 2606:4700:20::ac43:479c
Response IP 104.26.7.229
Found Yes
Hash 171d9ae10726e78c154ba2cdd1f77cbd0b0fb6c33a957bf5167ec13b09915939
SimHash b0389840d555

Groups

blexbot

Rule Path
Disallow /

twitterbot

Rule Path
Disallow

gptbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

*

Rule Path
Disallow /action/
Disallow /publish
Disallow /sign-in
Disallow /channel-frame
Disallow /visited-surface-frame
Disallow /feed/private
Disallow /feed/podcast/*/private/*.rss
Disallow /subscribe
Disallow /lovestack/*
Disallow /p/*/comment/*
Disallow /inbox/post/*
Disallow /notes/post/*
Disallow /embed

facebookexternalhit

Rule Path
Allow /
Allow /subscribe

Other Records

Field Value
sitemap https://www.generalist.com/sitemap.xml
sitemap https://www.generalist.com/news_sitemap.xml