gnugk.org
robots.txt

Robots Exclusion Standard data for gnugk.org

Resource Scan

Scan Details

Site Domain gnugk.org
Base Domain gnugk.org
Scan Status Ok
Last Scan2026-01-02T11:27:02+00:00
Next Scan 2026-02-01T11:27:02+00:00

Last Scan

Scanned2026-01-02T11:27:02+00:00
URL https://gnugk.org/robots.txt
Redirect https://www.gnugk.org/robots.txt
Redirect Domain www.gnugk.org
Redirect Base gnugk.org
Domain IPs 104.21.59.52, 172.67.214.143, 2606:4700:3035::ac43:d68f, 2606:4700:3037::6815:3b34
Redirect IPs 104.21.59.52, 172.67.214.143, 2606:4700:3035::ac43:d68f, 2606:4700:3037::6815:3b34
Response IP 104.21.59.52
Found Yes
Hash b94bad1a394e2c022e8d2aa133df340cdc55ad9cb866635802dd5b0b23e8516a
SimHash 141cd3426a63

Groups

google-extended

Rule Path
Disallow /

oai-searchbot

Rule Path
Disallow /perl/
Disallow /products/
Disallow /?q=

*

Rule Path
Disallow /ext/
Disallow /sid/
Disallow /simg/