linux.org
robots.txt
Robots Exclusion Standard data for linux.org
Resource Scan
Scan Details
| Site Domain | linux.org |
| Base Domain | linux.org |
| Scan Status | Failed |
| Failure Stage | Fetching resource. |
| Failure Reason | Server returned a client error. |
| Last Scan | 2025-10-24T21:16:49+00:00 |
| Next Scan | 2026-01-22T21:16:49+00:00 |
Last Successful Scan
| Scanned | 2025-03-29T06:33:40+00:00 |
| URL | https://linux.org/robots.txt |
| Domain IPs | 104.26.14.72, 104.26.15.72, 172.67.73.26, 2606:4700:20::681a:e48, 2606:4700:20::681a:f48, 2606:4700:20::ac43:491a |
| Response IP | 104.26.14.72 |
| Found | Yes |
| Hash | 301ce083e87b349ddba1ab378f46b7eff47019d29aca9486113ce679fbb2e4a2 |
| SimHash | 005f4c409d95 |
Groups
*
| Rule | Path |
|---|---|
| Disallow | /admin.php |
| Disallow | /internal_data/ |
| Disallow | /library/ |
| Disallow | /install/ |
| Disallow | /data/ |
| Disallow | /account/ |
| Disallow | /posts/ |
| Disallow | /search/ |
| Disallow | /members/ |
| Disallow | /login/ |
| Disallow | /register/ |
| Disallow | /conversations/ |
| Disallow | /attachments/ |
| Allow | / |
Other Records
| Field | Value |
|---|---|
| sitemap | https://www.linux.org/sitemap.xml |