o-ha.news
robots.txt

Robots Exclusion Standard data for o-ha.news

Resource Scan

Scan Details

Site Domain o-ha.news
Base Domain o-ha.news
Scan Status Ok
Last Scan2025-03-12T04:00:53+00:00
Next Scan 2025-03-19T04:00:53+00:00

Last Scan

Scanned2025-03-12T04:00:53+00:00
URL https://o-ha.news/robots.txt
Domain IPs 52.178.90.230
Response IP 52.178.90.230
Found Yes
Hash b0c6010eb5e6f99888d48947d3ef06cebc45bf60c65cb3d1563281e6c19854a7
SimHash 6a28dac081b5

Groups

*

Rule Path
Disallow /click/
Disallow /content/pdf/
Disallow /app/

googlebot-news
twitterbot

Rule Path
Allow *

facebookexternalhit

Rule Path
Allow *

yandex

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 60

ccbot

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

omgili

Rule Path
Disallow /