cnnphilippines.com
robots.txt

Robots Exclusion Standard data for cnnphilippines.com

Resource Scan

Scan Details

Site Domain cnnphilippines.com
Base Domain cnnphilippines.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a server error.
Last Scan2024-08-26T09:24:49+00:00
Next Scan 2024-11-24T09:24:49+00:00

Last Successful Scan

Scanned2024-01-30T09:22:46+00:00
URL https://cnnphilippines.com/robots.txt
Redirect https://www.cnnphilippines.com/robots.txt
Redirect Domain www.cnnphilippines.com
Redirect Base cnnphilippines.com
Domain IPs 151.101.194.132
Redirect IPs 151.101.130.132, 151.101.194.132, 151.101.2.132, 151.101.66.132, 2a04:4e42:200::644, 2a04:4e42:400::644, 2a04:4e42:600::644, 2a04:4e42::644
Response IP 199.232.46.132
Found Yes
Hash 1c5a44df74055fdc2fb095150b70ad2393921b8f2b3ee781cb85e3fa673be7bc
SimHash c944981337d6

Groups

googlebot

Rule Path
Allow /

Other Records

Field Value
sitemap http://cnnphilippines.com/sitemap/archive/sections.xml

Comments

  • Rule 1
  • no bot may crawl
  • User-agent: *
  • Disallow: /
  • Rule 2