arhivach.top
robots.txt

Robots Exclusion Standard data for arhivach.top

Resource Scan

Scan Details

Site Domain arhivach.top
Base Domain arhivach.top
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2024-11-04T15:17:49+00:00
Next Scan 2025-01-03T15:17:49+00:00

Last Successful Scan

Scanned2024-09-06T15:13:41+00:00
URL https://arhivach.top/robots.txt
Domain IPs 104.21.22.60, 172.67.202.252, 2606:4700:3031::6815:163c, 2606:4700:3033::ac43:cafc
Response IP 104.21.22.60
Found Yes
Hash f3dcff84e251e67ba3673523ca4ff1e9609d2d08da48cbddd8f005c2a6fbe52e
SimHash 2b0f7890ded8

Groups

yandexbot

Rule Path
Disallow /index/
Disallow /storage/*.webm
Disallow /storage2/*.webm
Disallow /storage3/*.webm

bingbot

Rule Path
Disallow /storage/*.webm
Disallow /storage2/*.webm
Disallow /storage3/*.webm

Other Records

Field Value
crawl-delay 1

mj12bot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

semrushbot-sa

Rule Path
Disallow /

semrushbot-ba

Rule Path
Disallow /

sentibot

Rule Path
Disallow /

cliqzbot

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

barkrowler

Rule Path
Disallow /

Other Records

Field Value
sitemap https://arhivach.top/sitemap.xml

Warnings

  • 11 invalid lines.