linux.org
robots.txt

Robots Exclusion Standard data for linux.org

Resource Scan

Scan Details

Site Domain linux.org
Base Domain linux.org
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2025-10-24T21:16:49+00:00
Next Scan 2026-01-22T21:16:49+00:00

Last Successful Scan

Scanned2025-03-29T06:33:40+00:00
URL https://linux.org/robots.txt
Domain IPs 104.26.14.72, 104.26.15.72, 172.67.73.26, 2606:4700:20::681a:e48, 2606:4700:20::681a:f48, 2606:4700:20::ac43:491a
Response IP 104.26.14.72
Found Yes
Hash 301ce083e87b349ddba1ab378f46b7eff47019d29aca9486113ce679fbb2e4a2
SimHash 005f4c409d95

Groups

*

Rule Path
Disallow /admin.php
Disallow /internal_data/
Disallow /library/
Disallow /install/
Disallow /data/
Disallow /account/
Disallow /posts/
Disallow /search/
Disallow /members/
Disallow /login/
Disallow /register/
Disallow /conversations/
Disallow /attachments/
Allow /

Other Records

Field Value
sitemap https://www.linux.org/sitemap.xml