dgruien.com
robots.txt

Robots Exclusion Standard data for dgruien.com

Resource Scan

Scan Details

Site Domain dgruien.com
Base Domain dgruien.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2025-12-19T22:07:18+00:00
Next Scan 2026-03-19T22:07:18+00:00

Last Successful Scan

Scanned2025-07-30T04:21:51+00:00
URL https://www.dgruien.com/robots.txt
Domain IPs 104.21.61.175, 172.67.212.151, 2606:4700:3034::6815:3daf, 2606:4700:3035::ac43:d497
Response IP 172.67.212.151
Found Yes
Hash 9da5063e1be9bde7a9435f3cde4ad33b75554367fac991a78ce46c349e3e6a32
SimHash a24f50427d11

Groups

*

Rule Path
Allow /

teleport

Rule Path
Disallow /

teleportpro

Rule Path
Disallow /

emailcollector

Rule Path
Disallow /

emailsiphon

Rule Path
Disallow /

webbandit

Rule Path
Disallow /

webzip

Rule Path
Disallow /

webreaper

Rule Path
Disallow /

webstripper

Rule Path
Disallow /

web downloader

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

webcopier

Rule Path
Disallow /

offline explorer pro

Rule Path
Disallow /

offline explorer

Rule Path
Disallow /

httrack website copier

Rule Path
Disallow /

offline commander

Rule Path
Disallow /

leech

Rule Path
Disallow /

websnake

Rule Path
Disallow /

blackwidow

Rule Path
Disallow /

http weazel

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.dgruien.com/sitemap.xml