thrivingguide.com
robots.txt

Robots Exclusion Standard data for thrivingguide.com

Resource Scan

Scan Details

Site Domain thrivingguide.com
Base Domain thrivingguide.com
Scan Status Ok
Last Scan2026-03-05T09:20:52+00:00
Next Scan 2026-03-12T09:20:52+00:00

Last Scan

Scanned2026-03-05T09:20:52+00:00
URL https://thrivingguide.com/robots.txt
Domain IPs 104.16.243.55
Response IP 104.16.243.55
Found Yes
Hash 90a5c0122c854f7937ed449c1d60a8b9a386b35c372917c6cd5f825612bf3ce3
SimHash 6b1ddcf2ef01

Groups

amazonbot

Rule Path
Disallow /

googlebot

Rule Path
Disallow /nogooglebot/

*

Rule Path
Disallow /login

adsbot-google

Rule Path
Disallow /login

nutch

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /login

Other Records

Field Value
crawl-delay 10

ahrefssiteaudit

Rule Path
Disallow /login

Other Records

Field Value
crawl-delay 10

mj12bot

Rule Path
Disallow /login

Other Records

Field Value
crawl-delay 10

Other Records

Field Value
sitemap https://thrivingguide.com/sitemap.xml