dgruien.com
robots.txt

Robots Exclusion Standard data for dgruien.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	dgruien.com
Base Domain	dgruien.com
Scan Status	Failed
Failure Stage	Fetching resource.
Failure Reason	Couldn't connect to server.
Last Scan	2025-12-19T22:07:18+00:00
Next Scan	2026-03-19T22:07:18+00:00

Last Successful Scan

Scanned	2025-07-30T04:21:51+00:00
URL	https://www.dgruien.com/robots.txt
Domain IPs	104.21.61.175, 172.67.212.151, 2606:4700:3034::6815:3daf, 2606:4700:3035::ac43:d497
Response IP	172.67.212.151
Found	Yes
Hash	9da5063e1be9bde7a9435f3cde4ad33b75554367fac991a78ce46c349e3e6a32
SimHash	a24f50427d11

Groups

*

Rule	Path
Allow	/

Rule

Path

Allow

teleport

Rule	Path
Disallow	/

Rule

Path

Disallow

teleportpro

Rule	Path
Disallow	/

Rule

Path

Disallow

emailcollector

Rule	Path
Disallow	/

Rule

Path

Disallow

emailsiphon

Rule	Path
Disallow	/

Rule

Path

Disallow

webbandit

Rule	Path
Disallow	/

Rule

Path

Disallow

webzip

Rule	Path
Disallow	/

Rule

Path

Disallow

webreaper

Rule	Path
Disallow	/

Rule

Path

Disallow

webstripper

Rule	Path
Disallow	/

Rule

Path

Disallow

web downloader

Rule	Path
Disallow	/

Rule

Path

Disallow

ahrefsbot

Rule	Path
Disallow	/

Rule

Path

Disallow

semrushbot

Rule	Path
Disallow	/

Rule

Path

Disallow

mj12bot

Rule	Path
Disallow	/

Rule

Path

Disallow

webcopier

Rule	Path
Disallow	/

Rule

Path

Disallow

offline explorer pro

Rule	Path
Disallow	/

Rule

Path

Disallow

offline explorer

Rule	Path
Disallow	/

Rule

Path

Disallow

httrack website copier

Rule	Path
Disallow	/

Rule

Path

Disallow

offline commander

Rule	Path
Disallow	/

Rule

Path

Disallow

leech

Rule	Path
Disallow	/

Rule

Path

Disallow

websnake

Rule	Path
Disallow	/

Rule

Path

Disallow

blackwidow

Rule	Path
Disallow	/

Rule

Path

Disallow

http weazel

Rule	Path
Disallow	/

Rule

Path

Disallow

Other Records

Field	Value
sitemap	https://www.dgruien.com/sitemap.xml

Field

Value

sitemap

https://www.dgruien.com/sitemap.xml

dgruien.comrobots.txt

Resource Scan

Scan Details

Last Successful Scan

Groups

*

teleport

teleportpro

emailcollector

emailsiphon

webbandit

webzip

webreaper

webstripper

web downloader

ahrefsbot

semrushbot

mj12bot

webcopier

offline explorer pro

offline explorer

httrack website copier

offline commander

leech

websnake

blackwidow

http weazel

Other Records

dgruien.com
robots.txt