arhivach.xyz
robots.txt

Robots Exclusion Standard data for arhivach.xyz

Resource Scan

Scan Details

Site Domain arhivach.xyz
Base Domain arhivach.xyz
Scan Status Ok
Last Scan2024-05-31T09:32:31+00:00
Next Scan 2024-06-07T09:32:31+00:00

Last Scan

Scanned2024-05-31T09:32:31+00:00
URL https://arhivach.xyz/robots.txt
Redirect http://arhivach.top/robots.txt
Redirect Domain arhivach.top
Redirect Base arhivach.top
Domain IPs 104.21.21.4, 172.67.195.68, 2606:4700:3032::6815:1504, 2606:4700:3032::ac43:c344
Redirect IPs 104.21.22.60, 172.67.202.252, 2606:4700:3031::6815:163c, 2606:4700:3033::ac43:cafc
Response IP 172.67.202.252
Found Yes
Hash 8395b246ff292da0db1fff2245c02c9fa1249aa50836d81c60ce75f8a01352e6
SimHash 6b8e7845d758

Groups

yandexbot

Rule Path
Disallow /index/
Disallow /storage/*.webm
Disallow /storage2/*.webm
Disallow /storage3/*.webm

bingbot

Rule Path
Disallow /storage/*.webm
Disallow /storage2/*.webm
Disallow /storage3/*.webm

Other Records

Field Value
crawl-delay 1

mj12bot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

semrushbot-sa

Rule Path
Disallow /

sentibot

Rule Path
Disallow /

cliqzbot

Rule Path
Disallow /

cliqzbot/2.0

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

Other Records

Field Value
sitemap http://arhivach.top/sitemap.xml

Warnings

  • 11 invalid lines.