theharrison-press.com
robots.txt

Robots Exclusion Standard data for theharrison-press.com

Resource Scan

Scan Details

Site Domain theharrison-press.com
Base Domain theharrison-press.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2024-04-03T19:18:33+00:00
Next Scan 2024-07-02T19:18:33+00:00

Last Successful Scan

Scanned2023-12-06T19:08:18+00:00
URL http://theharrison-press.com/robots.txt
Redirect https://www.registerpublications.com/robots.txt
Redirect Domain www.registerpublications.com
Redirect Base registerpublications.com
Domain IPs 104.196.37.2
Redirect IPs 74.84.144.174
Response IP 74.84.144.174
Found Yes
Hash 889d28ab84c8fb29de3ab7a15f5b64833de6f1d7e51c16416353d10615b4b744
SimHash 0818c9500135

Groups

google-extended

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

*

Rule Path
Allow /ads.txt
Disallow /ads

Comments

  • User-agent: Googlebot
  • Disallow: /

Warnings

  • 1 invalid line.