readtheblueprint.com
robots.txt

Robots Exclusion Standard data for readtheblueprint.com

Resource Scan

Scan Details

Site Domain readtheblueprint.com
Base Domain readtheblueprint.com
Scan Status Ok
Last Scan2026-02-08T04:08:33+00:00
Next Scan 2026-03-10T04:08:33+00:00

Last Scan

Scanned2026-02-08T04:08:33+00:00
URL https://www.readtheblueprint.com/robots.txt
Domain IPs 104.18.68.40, 104.18.69.40, 2606:4700::6812:4428, 2606:4700::6812:4528
Response IP 104.18.69.40
Found Yes
Hash 66b66315086cbbed1d67abbbd68dafd3a284714d9e3bf848fe90283756a09d3f
SimHash 6b1ddcf2ab10

Groups

amazonbot

Rule Path
Disallow /

googlebot

Rule Path
Disallow /nogooglebot/

*

Rule Path
Disallow /login

adsbot-google

Rule Path
Disallow /login

nutch

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /login

Other Records

Field Value
crawl-delay 10

ahrefssiteaudit

Rule Path
Disallow /login

Other Records

Field Value
crawl-delay 10

mj12bot

Rule Path
Disallow /login

Other Records

Field Value
crawl-delay 10

Other Records

Field Value
sitemap https://www.readtheblueprint.com/sitemap.xml