clgt.net
robots.txt

Robots Exclusion Standard data for clgt.net

Archived Snapshots

Resource Scan

Scan Details

Site Domain	clgt.net
Base Domain	clgt.net
Scan Status	Ok
Last Scan	2024-11-16T18:08:32+00:00
Next Scan	2024-11-23T18:08:32+00:00

Last Scan

Scanned	2024-11-16T18:08:32+00:00
URL	https://clgt.net/robots.txt
Domain IPs	104.21.234.222, 104.21.234.223, 2606:4700:3038::6815:eade, 2606:4700:3038::6815:eadf
Response IP	104.21.234.222
Found	Yes
Hash	9ec959fc4a508429bcbaf0bd7e3f6087b61e3e067eb85844045e4b9517bc0423
SimHash	be4b7a464d51

Groups

teleport

Rule	Path
Disallow	/

Rule

Path

Disallow

teleportpro

Rule	Path
Disallow	/

Rule

Path

Disallow

emailcollector

Rule	Path
Disallow	/

Rule

Path

Disallow

emailsiphon

Rule	Path
Disallow	/

Rule

Path

Disallow

webbandit

Rule	Path
Disallow	/

Rule

Path

Disallow

webzip

Rule	Path
Disallow	/

Rule

Path

Disallow

webreaper

Rule	Path
Disallow	/

Rule

Path

Disallow

webstripper

Rule	Path
Disallow	/

Rule

Path

Disallow

web downloader

Rule	Path
Disallow	/

Rule

Path

Disallow

webcopier

Rule	Path
Disallow	/

Rule

Path

Disallow

offline explorer pro

Rule	Path
Disallow	/

Rule

Path

Disallow

offline explorer

Rule	Path
Disallow	/

Rule

Path

Disallow

httrack website copier

Rule	Path
Disallow	/

Rule

Path

Disallow

offline commander

Rule	Path
Disallow	/

Rule

Path

Disallow

leech

Rule	Path
Disallow	/

Rule

Path

Disallow

websnake

Rule	Path
Disallow	/

Rule

Path

Disallow

blackwidow

Rule	Path
Disallow	/

Rule

Path

Disallow

http weazel

Rule	Path
Disallow	/

Rule

Path

Disallow

clgt.netrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

teleport

teleportpro

emailcollector

emailsiphon

webbandit

webzip

webreaper

webstripper

web downloader

webcopier

offline explorer pro

offline explorer

httrack website copier

offline commander

leech

websnake

blackwidow

http weazel

clgt.net
robots.txt