w3newslive.com
robots.txt
Robots Exclusion Standard data for w3newslive.com
Resource Scan
Scan Details
Site Domain | w3newslive.com |
Base Domain | w3newslive.com |
Scan Status | Failed |
Failure Stage | Fetching resource. |
Failure Reason | Server returned a client error. |
Last Scan | 2025-04-27T14:55:16+00:00 |
Next Scan | 2025-07-26T14:55:16+00:00 |
Last Successful Scan
Scanned | 2023-04-15T14:50:42+00:00 |
URL | https://w3newslive.com/robots.txt |
Redirect | https://www.newspapersland.com/robots.txt |
Redirect Domain | www.newspapersland.com |
Redirect Base | newspapersland.com |
Domain IPs | 104.21.70.26, 172.67.218.173, 2606:4700:3033::6815:461a, 2606:4700:3036::ac43:daad |
Redirect IPs | 104.21.29.63, 172.67.148.132, 2606:4700:3030::6815:1d3f, 2606:4700:3035::ac43:9484 |
Response IP | 104.21.29.63 |
Found | Yes |
Hash | e961cfb2ff51a1210112c0df72abebec87e4a3a546193bf8974c88e743533d99 |
SimHash | 7910c8404792 |
Groups
*
Rule | Path |
---|---|
Disallow | /wp-admin/ |
Disallow | /readme.html |
Disallow | /refer/ |
Disallow | /trackback/ |
Disallow | /cgi-bin/ |
Disallow | /android/ |
Allow | /wp-admin/admin-ajax.php |
Other Records
Field | Value |
---|---|
sitemap | https://www.newspapersland.com/sitemap.xml |
sitemap | https://www.newspapersland.com/page-sitemap.xml |