lordfilm.bot
robots.txt

Robots Exclusion Standard data for lordfilm.bot

Resource Scan

Scan Details

Site Domain lordfilm.bot
Base Domain lordfilm.bot
Scan Status Ok
Last Scan2024-05-07T00:11:46+00:00
Next Scan 2024-06-06T00:11:46+00:00

Last Scan

Scanned2024-05-07T00:11:46+00:00
URL https://www.lordfilm.bot/robots.txt
Domain IPs 104.26.14.111, 104.26.15.111, 172.67.68.27, 2606:4700:20::681a:e6f, 2606:4700:20::681a:f6f, 2606:4700:20::ac43:441b
Response IP 104.26.14.111
Found Yes
Hash 1b31e67b9a0388a66506731b1d5d0538b0d3cd21bc33f8f90bf06ffe2564261c
SimHash 59192553c711

Groups

yandex

Rule Path
Disallow /

*

Rule Path
Allow /engine/classes/min/index.php?
Allow /templates/lordfilm/fonts/*
Allow /engine/classes/js/lazyload.js
Allow /engine/classes/js/jqueryui3.js
Allow /engine/classes/js/jquery3.js
Allow /engine/classes/js/dle_js.js
Disallow /cdn-cgi/
Disallow /engine/
Disallow /2021/
Disallow /2022/
Disallow /2023/
Disallow /2020/
Disallow /language/
Disallow /newposts/
Disallow /lastnews/
Disallow /*subaction%3Duserinfo
Disallow /*subaction%3Dnewposts
Disallow /*do%3Dlastcomments
Disallow /*do%3Dfeedback
Disallow /*do%3Dregister
Disallow /*do%3Drules
Disallow /*do%3Dlostpassword
Disallow /*do%3Daddnews
Disallow /*do%3Dstats
Disallow /*do%3Dpm
Disallow /*do%3Dsearch
Disallow /*?
Disallow */f/*

blexbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

criteobot/0.1

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

proximic

Rule Path
Disallow /

grapeshot

Rule Path
Disallow /

ia_archiver

Rule Path
Disallow /

Other Records

Field Value
sitemap https://lordfilm.bot/sitemap.xml

Warnings

  • `host` is not a known field.