linuxlearn.org
robots.txt
Robots Exclusion Standard data for linuxlearn.org
Resource Scan
Scan Details
| Site Domain | linuxlearn.org |
| Base Domain | linuxlearn.org |
| Scan Status | Ok |
| Last Scan | 2026-01-04T03:37:00+00:00 |
| Next Scan | 2026-02-03T03:37:00+00:00 |
Last Scan
| Scanned | 2026-01-04T03:37:00+00:00 |
| URL | https://linuxlearn.org/robots.txt |
| Domain IPs | 104.21.50.34, 172.67.199.246, 2606:4700:3031::ac43:c7f6, 2606:4700:3037::6815:3222 |
| Response IP | 172.67.199.246 |
| Found | Yes |
| Hash | 732eb9b239273eb43a9f4fc397cf463f335746b1a5172896f3b7ffa58576eb9b |
| SimHash | 794188406d9f |
Groups
*
| Rule | Path |
|---|---|
| Disallow | /wp-content/uploads/wc-logs/ |
| Disallow | /wp-content/uploads/woocommerce_transient_files/ |
| Disallow | /wp-content/uploads/woocommerce_uploads/ |
| Disallow | /wp-admin/ |
| Allow | /wp-admin/admin-ajax.php |
*
| Rule | Path |
|---|---|
| Disallow | /wp-content/uploads/wpforms/ |
Other Records
| Field | Value |
|---|---|
| sitemap | https://linuxlearn.org/wp-sitemap.xml |
Comments