x.dawn.com
robots.txt

Robots Exclusion Standard data for x.dawn.com

Resource Scan

Scan Details

Site Domain x.dawn.com
Base Domain dawn.com
Scan Status Ok
Last Scan2024-05-12T09:26:45+00:00
Next Scan 2024-05-19T09:26:45+00:00

Last Scan

Scanned2024-05-12T09:26:45+00:00
URL https://x.dawn.com/robots.txt
Redirect https://www.dawn.com/robots.txt
Redirect Domain www.dawn.com
Redirect Base dawn.com
Domain IPs 104.21.74.112, 172.67.157.237, 2606:4700:3032::ac43:9ded, 2606:4700:3033::6815:4a70
Redirect IPs 104.21.74.112, 172.67.157.237, 2606:4700:3032::ac43:9ded, 2606:4700:3033::6815:4a70
Response IP 104.21.74.112
Found Yes
Hash 16b0e3b64ab93b1739ad059d79e1fdb4689dae0cd3197059a989e9b9c5ae4ef3
SimHash ad169cc8caf8

Groups

*

Rule Path
Disallow */print
Disallow */authors/*/1*
Disallow */authors/*/2*
Disallow */authors/*/3*
Disallow */authors/*/4*
Disallow */authors/*/5*
Disallow */authors/*/6*
Disallow */authors/*/7*
Disallow */authors/*/8*
Disallow */authors/*/9*
Disallow /newspaper/*/20*
Disallow /archive/*

Comments

  • test tool
  • https://www.google.com/webmasters/tools/ (Crawl > Blocked URLs)