trccg.org
robots.txt

Robots Exclusion Standard data for trccg.org

Archived Snapshots

Resource Scan

Scan Details

Site Domain	trccg.org
Base Domain	trccg.org
Scan Status	Ok
Last Scan	2024-05-16T08:56:21+00:00
Next Scan	2024-06-15T08:56:21+00:00

Last Scan

Scanned	2024-05-16T08:56:21+00:00
URL	https://trccg.org/robots.txt
Domain IPs	104.26.2.237, 104.26.3.237, 172.67.69.173, 2606:4700:20::681a:2ed, 2606:4700:20::681a:3ed, 2606:4700:20::ac43:45ad
Response IP	104.26.2.237
Found	Yes
Hash	aa74e18cf493546204c53e367936b9547cf11fdd2a61242fde3b06d9dd765e7e
SimHash	9e4b70464d51

Groups

teleport

Rule	Path
Disallow	/

Rule

Path

Disallow

teleportpro

Rule	Path
Disallow	/

Rule

Path

Disallow

emailcollector

Rule	Path
Disallow	/

Rule

Path

Disallow

emailsiphon

Rule	Path
Disallow	/

Rule

Path

Disallow

webbandit

Rule	Path
Disallow	/

Rule

Path

Disallow

webzip

Rule	Path
Disallow	/

Rule

Path

Disallow

webreaper

Rule	Path
Disallow	/

Rule

Path

Disallow

webstripper

Rule	Path
Disallow	/

Rule

Path

Disallow

web downloader

Rule	Path
Disallow	/

Rule

Path

Disallow

webcopier

Rule	Path
Disallow	/

Rule

Path

Disallow

offline explorer pro

Rule	Path
Disallow	/

Rule

Path

Disallow

httrack website copier

Rule	Path
Disallow	/

Rule

Path

Disallow

offline commander

Rule	Path
Disallow	/

Rule

Path

Disallow

leech

Rule	Path
Disallow	/

Rule

Path

Disallow

websnake

Rule	Path
Disallow	/

Rule

Path

Disallow

blackwidow

Rule	Path
Disallow	/

Rule

Path

Disallow

http weazel

Rule	Path
Disallow	/

Rule

Path

Disallow

trccg.orgrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

teleport

teleportpro

emailcollector

emailsiphon

webbandit

webzip

webreaper

webstripper

web downloader

webcopier

offline explorer pro

httrack website copier

offline commander

leech

websnake

blackwidow

http weazel

trccg.org
robots.txt