roliki.tv
robots.txt

Robots Exclusion Standard data for roliki.tv

Resource Scan

Scan Details

Site Domain roliki.tv
Base Domain roliki.tv
Scan Status Ok
Last Scan2024-09-19T22:55:25+00:00
Next Scan 2024-10-19T22:55:25+00:00

Last Scan

Scanned2024-09-19T22:55:25+00:00
URL https://roliki.tv/robots.txt
Domain IPs 104.21.88.84, 172.67.174.81, 2606:4700:3036::ac43:ae51, 2606:4700:3037::6815:5854
Response IP 172.67.174.81
Found Yes
Hash 7ce904e85f5319d04d5a0a51ceaba0c43fa7305339b57f5484f92457bdca2afb
SimHash 522cd1426b38

Groups

*

Rule Path
Disallow /*.php*
Disallow /_*
Disallow /*?*

yandex

Rule Path
Disallow /*.php*
Disallow /_*
Disallow /*?*
Disallow /porno/*
Disallow /search*
Disallow /*page*

baiduspider

Rule Path
Disallow /

yandexvideoparser

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

vagabondo

Rule Path
Disallow /

seokicks-robot

Rule Path
Disallow /

ia_archiver

Rule Path
Disallow /

archive.org_bot

Rule Path
Disallow /

special_archiver

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

special_archiver

Rule Path
Disallow /

heritrix

Rule Path
Disallow /

netestate ne crawler

Rule Path
Disallow /

sistrix

Rule Path
Disallow /

wbsearchbot

Rule Path
Disallow /

queryseekerspider

Rule Path
Disallow /

proximic

Rule Path
Disallow /

siteexplorer

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

semrushbot-sa

Rule Path
Disallow /

Warnings

  • 2 invalid lines.
  • `host` is not a known field.