bolf.lt
robots.txt

Robots Exclusion Standard data for bolf.lt

Resource Scan

Scan Details

Site Domain bolf.lt
Base Domain bolf.lt
Scan Status Ok
Last Scan2025-11-14T17:38:34+00:00
Next Scan 2025-12-14T17:38:34+00:00

Last Scan

Scanned2025-11-14T17:38:34+00:00
URL https://bolf.lt/robots.txt
Domain IPs 5.149.163.145
Response IP 5.149.163.145
Found Yes
Hash f1f09a4b79639297131f453618713713a1f2d20805bc54f49dbd01298c945e8d
SimHash 1a1cd6508833

Groups

*

Rule Path
Disallow /*?rec=*
Disallow /*%26rec%3D*

ahrefsbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

metajobbot

Rule Path
Disallow /

exabot

Rule Path
Disallow /

ezooms

Rule Path
Disallow /

fyberspider

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

mojeekbot

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

wotbox

Rule Path
Disallow /

twengabot-discover

Rule Path
Disallow /

twengabot/2.0

Rule Path
Disallow /

twengabot

Rule Path
Disallow /

twengabot-2.0

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

yandex

Rule Path
Disallow /

*

Rule Path
Disallow /*?filter_traits
Disallow /*?selected_size

Other Records

Field Value
sitemap https://bolf.lt/sitemap.xml.gz

Comments

  • Pages with rec parameter - IAI Recommendation System
  • Automatically banned scanners and crawlers section
  • Section end

Warnings

  • 3 invalid lines.