arhivach.hk
robots.txt

Robots Exclusion Standard data for arhivach.hk

Resource Scan

Scan Details

Site Domain arhivach.hk
Base Domain arhivach.hk
Scan Status Ok
Last Scan2025-03-09T23:05:51+00:00
Next Scan 2025-03-16T23:05:51+00:00

Last Scan

Scanned2025-03-09T23:05:51+00:00
URL https://arhivach.hk/robots.txt
Domain IPs 104.21.75.45, 172.67.213.193, 2606:4700:3030::6815:4b2d, 2606:4700:3033::ac43:d5c1
Response IP 104.21.75.45
Found Yes
Hash 044196942e0baff5f0208e3354a0771d8c6a7957e62dc51ae534b1fdfda58ce2
SimHash 3b0f7890d7d8

Groups

yandexbot

Rule Path
Disallow /index/
Disallow /storage/*.webm
Disallow /storage/*.mp4
Disallow /storage2/*.webm
Disallow /storage2/*.mp4
Disallow /storage3/*.webm
Disallow /storage3/*.mp4

bingbot

Rule Path
Disallow /storage/*.webm
Disallow /storage/*.mp4
Disallow /storage2/*.webm
Disallow /storage2/*.mp4
Disallow /storage3/*.webm
Disallow /storage3/*.mp4

Other Records

Field Value
crawl-delay 1

mj12bot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

semrushbot-sa

Rule Path
Disallow /

semrushbot-ba

Rule Path
Disallow /

sentibot

Rule Path
Disallow /

cliqzbot

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

barkrowler

Rule Path
Disallow /

Other Records

Field Value
sitemap https://arhivach.hk/sitemap.xml

Warnings

  • 11 invalid lines.