filmoteka.cc
robots.txt

Robots Exclusion Standard data for filmoteka.cc

Resource Scan

Scan Details

Site Domain filmoteka.cc
Base Domain filmoteka.cc
Scan Status Ok
Last Scan2024-05-28T11:16:47+00:00
Next Scan 2024-06-27T11:16:47+00:00

Last Scan

Scanned2024-05-28T11:16:47+00:00
URL https://filmoteka.cc/robots.txt
Domain IPs 104.21.1.196, 172.67.129.223, 2606:4700:3030::ac43:81df, 2606:4700:3036::6815:1c4
Response IP 172.67.129.223
Found Yes
Hash cf3615c77c8f20aa896615ef13b6f5438c4f49efca85a054edf9935e2f21b516
SimHash d50d30797d31

Groups

*

Rule Path
Disallow /engine/go.php
Disallow /user/
Disallow /newposts/
Disallow /statistics.html
Disallow /*subaction%3Duserinfo
Disallow /*subaction%3Dnewposts
Disallow /*do%3Dlastcomments
Disallow /*do%3Dfeedback
Disallow /*do%3Dregister
Disallow /*do%3Dlostpassword
Disallow /*do%3Daddnews
Disallow /*do%3Dstats
Disallow /*do%3Dpm
Disallow /*do%3Dsearch
Disallow /*do%3Ddownload
Disallow /*do%3Dgo
Disallow /help/
Disallow */?filter=*
Disallow /year/*-*
Disallow /megasearch/
Disallow /continue/
Disallow /favorites/
Disallow /tags/*

blexbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

criteobot/0.1

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

proximic

Rule Path
Disallow /

grapeshot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://filmoteka.cc/sitemap.xml

Warnings

  • `host` is not a known field.