harian9.com
robots.txt

Robots Exclusion Standard data for harian9.com

Resource Scan

Scan Details

Site Domain harian9.com
Base Domain harian9.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-09-17T06:37:47+00:00
Next Scan 2024-12-16T06:37:47+00:00

Last Successful Scan

Scanned2024-05-19T20:04:27+00:00
URL https://harian9.com/robots.txt
Redirect https://www.harian9.com/robots.txt
Redirect Domain www.harian9.com
Redirect Base harian9.com
Domain IPs 104.21.14.138, 172.67.159.167, 2606:4700:3030::ac43:9fa7, 2606:4700:3036::6815:e8a
Redirect IPs 104.21.14.138, 172.67.159.167, 2606:4700:3030::ac43:9fa7, 2606:4700:3036::6815:e8a
Response IP 172.67.159.167
Found Yes
Hash 2eae4de29d7997d9466cfe83b7bde24a3ae44dd7f338df9e8a41898153ee6d75
SimHash 4835d9758171

Groups

*
googlebot

Rule Path
Allow /

chatgpt-user

Rule Path
Disallow /

openai

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

gptbot

Rule Path
Disallow /