everythinglinux.org
robots.txt

Robots Exclusion Standard data for everythinglinux.org

Resource Scan

Scan Details

Site Domain everythinglinux.org
Base Domain everythinglinux.org
Scan Status Ok
Last Scan2025-12-15T03:38:09+00:00
Next Scan 2025-12-22T03:38:09+00:00

Last Scan

Scanned2025-12-15T03:38:09+00:00
URL https://everythinglinux.org/robots.txt
Domain IPs 172.100.214.218
Response IP 172.100.214.218
Found Yes
Hash de81fcf81e0d4806ce8f14b28e358e191a87ec1f4fb24bfb10c078e9b736f808
SimHash 6d7d9a20e333

Groups

*

Rule Path
Disallow /stats/
Disallow /lct/
Disallow /cgi-bin/
Disallow /OFFLINE/
Disallow /assets/

mj12bot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

gptbot

Rule Path
Disallow /