truthout.com
robots.txt
Robots Exclusion Standard data for truthout.com
Resource Scan
Scan Details
Site Domain | truthout.com |
Base Domain | truthout.com |
Scan Status | Ok |
Last Scan | 2025-08-25T04:46:00+00:00 |
Next Scan | 2025-09-24T04:46:00+00:00 |
Last Scan
Scanned | 2025-08-25T04:46:00+00:00 |
URL | https://truthout.com/robots.txt |
Redirect | https://truthout.org/robots.txt |
Redirect Domain | truthout.org |
Redirect Base | truthout.org |
Domain IPs | 104.21.18.231, 172.67.183.224, 2606:4700:3030::6815:12e7, 2606:4700:3034::ac43:b7e0 |
Redirect IPs | 172.66.135.25, 172.66.139.249, 2606:4700:10::ac42:8719, 2606:4700:10::ac42:8bf9 |
Response IP | 172.66.139.249 |
Found | Yes |
Hash | 8600bdc2f58f7291444635a7cf02cdeb316515e3110463417183ef7683e0d0a3 |
SimHash | 41a4cf40d4c0 |
Groups
*
Rule | Path |
---|---|
Disallow | /wp-admin/ |
Disallow | /cdn-cgi/ |
Allow | /wp-admin/admin-ajax.php |
Other Records
Field | Value |
---|---|
crawl-delay | 10 |
googlebot
Rule | Path |
---|---|
Disallow | /wp-admin/ |
Disallow | /cdn-cgi/ |
Allow | /wp-admin/admin-ajax.php |
Other Records
Field | Value |
---|---|
crawl-delay | 0 |
Other Records
Field | Value |
---|---|
sitemap | https://truthout.org/sitemap_index.xml |
sitemap | https://truthout.org/sitemap_index.xml |
sitemap | https://truthout.org/news-sitemap.xml |
sitemap | https://truthout.org/news-sitemap.xml |
Comments