mycouriertribune.com
robots.txt

Robots Exclusion Standard data for mycouriertribune.com

Resource Scan

Scan Details

Site Domain mycouriertribune.com
Base Domain mycouriertribune.com
Scan Status Ok
Last Scan2024-11-14T02:29:01+00:00
Next Scan 2024-11-21T02:29:01+00:00

Last Scan

Scanned2024-11-14T02:29:01+00:00
URL https://mycouriertribune.com/robots.txt
Redirect https://www.mycouriertribune.com/robots.txt
Redirect Domain www.mycouriertribune.com
Redirect Base mycouriertribune.com
Domain IPs 74.84.144.174, 74.84.144.198
Redirect IPs 74.84.144.198
Response IP 74.84.144.174
Found Yes
Hash 889d28ab84c8fb29de3ab7a15f5b64833de6f1d7e51c16416353d10615b4b744
SimHash 0818c9500135

Groups

google-extended

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

*

Rule Path
Allow /ads.txt
Disallow /ads

Comments

  • User-agent: Googlebot
  • Disallow: /

Warnings

  • 1 invalid line.