twentyfournews.com
robots.txt

Robots Exclusion Standard data for twentyfournews.com

Resource Scan

Scan Details

Site Domain twentyfournews.com
Base Domain twentyfournews.com
Scan Status Ok
Last Scan2024-09-23T21:59:30+00:00
Next Scan 2024-09-30T21:59:30+00:00

Last Scan

Scanned2024-09-23T21:59:30+00:00
URL https://www.twentyfournews.com/robots.txt
Domain IPs 18.155.68.128, 18.155.68.39, 18.155.68.57, 18.155.68.73, 2600:9000:23d2:4000:9:eeec:d700:93a1, 2600:9000:23d2:6600:9:eeec:d700:93a1, 2600:9000:23d2:8c00:9:eeec:d700:93a1, 2600:9000:23d2:d600:9:eeec:d700:93a1, 2600:9000:23d2:e400:9:eeec:d700:93a1, 2600:9000:23d2:e600:9:eeec:d700:93a1, 2600:9000:23d2:e800:9:eeec:d700:93a1, 2600:9000:23d2:f200:9:eeec:d700:93a1
Response IP 18.155.68.73
Found Yes
Hash 756a5424c12bd6302f5b986390552b6559b45b75acf4ad132c6a57dc4246b68b
SimHash 2e3c3890288b

Groups

googlebot

Rule Path
Disallow /2016/
Disallow /cache/
Disallow /2017/
Disallow /2018/
Disallow /2019/
Disallow /2020/
Disallow /2021/
Disallow /2022/
Disallow /page/
Disallow /plugins/
Disallow /?p=*
Disallow /tmp/
Disallow /xmlrpc/
Disallow /wp-admin/

*

Rule Path
Disallow /wp-content/cache/

Other Records

Field Value
sitemap http://twentyfournews.com/sitemap_index.xml

Comments

  • BEGIN W3TC ROBOTS
  • END W3TC ROBOTS