trccg.org
robots.txt

Robots Exclusion Standard data for trccg.org

Resource Scan

Scan Details

Site Domain trccg.org
Base Domain trccg.org
Scan Status Ok
Last Scan2024-05-16T08:56:21+00:00
Next Scan 2024-06-15T08:56:21+00:00

Last Scan

Scanned2024-05-16T08:56:21+00:00
URL https://trccg.org/robots.txt
Domain IPs 104.26.2.237, 104.26.3.237, 172.67.69.173, 2606:4700:20::681a:2ed, 2606:4700:20::681a:3ed, 2606:4700:20::ac43:45ad
Response IP 104.26.2.237
Found Yes
Hash aa74e18cf493546204c53e367936b9547cf11fdd2a61242fde3b06d9dd765e7e
SimHash 9e4b70464d51

Groups

teleport

Rule Path
Disallow /

teleportpro

Rule Path
Disallow /

emailcollector

Rule Path
Disallow /

emailsiphon

Rule Path
Disallow /

webbandit

Rule Path
Disallow /

webzip

Rule Path
Disallow /

webreaper

Rule Path
Disallow /

webstripper

Rule Path
Disallow /

web downloader

Rule Path
Disallow /

webcopier

Rule Path
Disallow /

offline explorer pro

Rule Path
Disallow /

httrack website copier

Rule Path
Disallow /

offline commander

Rule Path
Disallow /

leech

Rule Path
Disallow /

websnake

Rule Path
Disallow /

blackwidow

Rule Path
Disallow /

http weazel

Rule Path
Disallow /