balatore.com.br
robots.txt

Robots Exclusion Standard data for balatore.com.br

Resource Scan

Scan Details

Site Domain balatore.com.br
Base Domain balatore.com.br
Scan Status Ok
Last Scan2024-06-08T14:58:58+00:00
Next Scan 2024-07-08T14:58:58+00:00

Last Scan

Scanned2024-06-08T14:58:58+00:00
URL https://balatore.com.br/robots.txt
Redirect https://www.balatore.com.br/robots.txt
Redirect Domain www.balatore.com.br
Redirect Base balatore.com.br
Domain IPs 104.21.24.34, 172.67.216.188, 2606:4700:3030::6815:1822, 2606:4700:3035::ac43:d8bc
Redirect IPs 104.21.24.34, 172.67.216.188, 2606:4700:3030::6815:1822, 2606:4700:3035::ac43:d8bc
Response IP 104.21.24.34
Found Yes
Hash c53fb713ff78fff2cb31e18ff84c78c36ad896306ced5d519b25712501ea9270
SimHash 3670c7a8ce11

Groups

*

Rule Path
Disallow /cdn-cgi/
Disallow /busca*
Disallow /*%26sort%3D*
Disallow /*%26order%3D*
Disallow /*%26limit%3D*
Disallow /*%26q%3D*
Disallow /*?q=*
Disallow /*%26filter%3D*
Disallow /*?filter=*
Disallow /*%26size%3D*
Disallow /*?size=*
Disallow /login$
Disallow /carrinho$
Disallow /cadastro$
Disallow /checkout$
Disallow /vale-presente
Disallow /meus-pedidos
Disallow /minha-conta
Disallow /minha-conta-afiliado

domaincrawler/3.0

Rule Path
Disallow /

dirbuster-0.12

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

baiduspider-video

Rule Path
Disallow /

baiduspider-image

Rule Path
Disallow /

baiduspider+

Rule Path
Disallow /

twengabot-discover

Rule Path
Disallow /

twengabot

Rule Path
Disallow /

twengabot-2.0

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

bdcbot

Rule Path
Disallow /

spbot

Rule Path
Disallow /

linkpadbot

Rule Path
Disallow /

wbsearchbot

Rule Path
Disallow /

addthis.com

Rule Path
Disallow /

exabot

Rule Path
Disallow /

yandeximages

Rule Path
Disallow /

yandex

Rule Path
Disallow /

slurp

Rule Path
Disallow /

spbot

Rule Path
Disallow /

everyonesocialbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://files.irroba.com.br/balaaiao/feeds/sitemap.xml