googlenewsweb.com
robots.txt

Robots Exclusion Standard data for googlenewsweb.com

Resource Scan

Scan Details

Site Domain googlenewsweb.com
Base Domain googlenewsweb.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2025-06-20T07:14:49+00:00
Next Scan 2025-09-18T07:14:49+00:00

Last Successful Scan

Scanned2023-11-29T05:13:49+00:00
URL https://googlenewsweb.com/robots.txt
Redirect https://www.googlenewsweb.com/robots.txt
Redirect Domain www.googlenewsweb.com
Redirect Base googlenewsweb.com
Domain IPs 216.239.32.21, 216.239.34.21, 216.239.36.21, 216.239.38.21
Redirect IPs 172.253.118.121, 2404:6800:4003:c02::79
Response IP 74.125.68.121
Found Yes
Hash 11b1efe96e6d655fa13ac73f6c53adbfe14be0321f16dd57586662c0736294a6
SimHash 4b04da604f92

Groups

mediapartners-google

Rule Path
Disallow

*

Rule Path
Disallow /search
Allow /

Other Records

Field Value
sitemap https://www.googlenewsweb.com/sitemap.xml