cnn.ph
robots.txt

Robots Exclusion Standard data for cnn.ph

Resource Scan

Scan Details

Site Domain cnn.ph
Base Domain cnn.ph
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2024-04-07T21:45:29+00:00
Next Scan 2024-07-06T21:45:29+00:00

Last Successful Scan

Scanned2023-03-15T20:00:47+00:00
URL https://cnn.ph/robots.txt
Redirect https://www.cnnphilippines.com:443/robots.txt
Redirect Domain www.cnnphilippines.com
Redirect Base cnnphilippines.com
Domain IPs 13.229.9.90, 18.136.152.191
Redirect IPs 151.101.130.132, 151.101.194.132, 151.101.2.132, 151.101.66.132, 2a04:4e42:200::644, 2a04:4e42:400::644, 2a04:4e42:600::644, 2a04:4e42::644
Response IP 199.232.46.132
Found Yes
Hash 1c5a44df74055fdc2fb095150b70ad2393921b8f2b3ee781cb85e3fa673be7bc
SimHash c944981337d6

Groups

googlebot

Rule Path
Allow /

Other Records

Field Value
sitemap http://cnnphilippines.com/sitemap/archive/sections.xml

Comments

  • Rule 1
  • no bot may crawl
  • User-agent: *
  • Disallow: /
  • Rule 2