w3newslive.com
robots.txt

Robots Exclusion Standard data for w3newslive.com

Resource Scan

Scan Details

Site Domain w3newslive.com
Base Domain w3newslive.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2025-04-27T14:55:16+00:00
Next Scan 2025-07-26T14:55:16+00:00

Last Successful Scan

Scanned2023-04-15T14:50:42+00:00
URL https://w3newslive.com/robots.txt
Redirect https://www.newspapersland.com/robots.txt
Redirect Domain www.newspapersland.com
Redirect Base newspapersland.com
Domain IPs 104.21.70.26, 172.67.218.173, 2606:4700:3033::6815:461a, 2606:4700:3036::ac43:daad
Redirect IPs 104.21.29.63, 172.67.148.132, 2606:4700:3030::6815:1d3f, 2606:4700:3035::ac43:9484
Response IP 104.21.29.63
Found Yes
Hash e961cfb2ff51a1210112c0df72abebec87e4a3a546193bf8974c88e743533d99
SimHash 7910c8404792

Groups

*

Rule Path
Disallow /wp-admin/
Disallow /readme.html
Disallow /refer/
Disallow /trackback/
Disallow /cgi-bin/
Disallow /android/
Allow /wp-admin/admin-ajax.php

Other Records

Field Value
sitemap https://www.newspapersland.com/sitemap.xml
sitemap https://www.newspapersland.com/page-sitemap.xml