blognestharbor.com
robots.txt

Robots Exclusion Standard data for blognestharbor.com

Resource Scan

Scan Details

Site Domain blognestharbor.com
Base Domain blognestharbor.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a server error.
Last Scan2026-02-17T23:44:23+00:00
Next Scan 2026-03-19T23:44:23+00:00

Last Successful Scan

Scanned2026-01-19T23:26:54+00:00
URL https://blognestharbor.com/robots.txt
Domain IPs 104.21.11.170, 172.67.166.175, 2606:4700:3031::6815:baa, 2606:4700:3032::ac43:a6af
Response IP 172.67.166.175
Found Yes
Hash da8503485e0aa1d681c3818ac4aefa1a3e8d3bd05be33220faf1249ade24f5af
SimHash 50154951efd4

Groups

*

Rule Path
Disallow /admin/
Disallow /

googlebot

Rule Path
Disallow
Allow /

bingbot

Rule Path
Disallow
Allow /

applebot

Rule Path
Disallow
Allow /

yandexbot

Rule Path
Disallow
Allow /

petalbot

Rule Path
Disallow
Allow /

ahrefsbot

Rule Path
Disallow
Allow /

semrushbot

Rule Path
Disallow
Allow /

mj12bot

Rule Path
Disallow
Allow /

dotbot

Rule Path
Disallow
Allow /

rogerbot

Rule Path
Disallow
Allow /

ccbot

Rule Path
Disallow
Allow /

dataforseobot

Rule Path
Disallow
Allow /

serpstatbot

Rule Path
Disallow
Allow /

seokicks

Rule Path
Disallow
Allow /

bytespider

Rule Path
Disallow
Allow /

Other Records

Field Value
sitemap https://blognestharbor.com/sitemap.xml

Comments

  • Global defaults — block all except the whitelisted bots
  • Allow specific, trusted bots