thegraftonnews.com
robots.txt

Robots Exclusion Standard data for thegraftonnews.com

Resource Scan

Scan Details

Site Domain thegraftonnews.com
Base Domain thegraftonnews.com
Scan Status Ok
Last Scan2024-11-15T21:28:30+00:00
Next Scan 2024-11-22T21:28:30+00:00

Last Scan

Scanned2024-11-15T21:28:30+00:00
URL https://thegraftonnews.com/robots.txt
Redirect https://www.thegraftonnews.com/robots.txt
Redirect Domain www.thegraftonnews.com
Redirect Base thegraftonnews.com
Domain IPs 74.84.144.174, 74.84.144.198
Redirect IPs 74.84.144.198
Response IP 74.84.144.174
Found Yes
Hash 889d28ab84c8fb29de3ab7a15f5b64833de6f1d7e51c16416353d10615b4b744
SimHash 0818c9500135

Groups

google-extended

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

*

Rule Path
Allow /ads.txt
Disallow /ads

Comments

  • User-agent: Googlebot
  • Disallow: /

Warnings

  • 1 invalid line.