leao1918.com.br
robots.txt

Robots Exclusion Standard data for leao1918.com.br

Resource Scan

Scan Details

Site Domain leao1918.com.br
Base Domain leao1918.com.br
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-09-17T04:05:50+00:00
Next Scan 2024-12-16T04:05:50+00:00

Last Successful Scan

Scanned2024-05-28T16:30:46+00:00
URL https://leao1918.com.br/robots.txt
Redirect https://www.leao1918.com.br/robots.txt
Redirect Domain www.leao1918.com.br
Redirect Base leao1918.com.br
Domain IPs 18.164.154.100, 18.164.154.7, 18.164.154.78, 18.164.154.84
Redirect IPs 18.173.121.126, 18.173.121.37, 18.173.121.42, 18.173.121.67
Response IP 108.157.60.19
Found Yes
Hash fa3d76fff46a7832c84eacced8dd0c952fcd503914adec176cf37addac5cf2b4
SimHash 3c70c6a4ca01

Groups

*

Rule Path
Disallow /cdn-cgi/
Disallow /busca*
Disallow /*%26sort%3D*
Disallow /*%26order%3D*
Disallow /*%26limit%3D*
Disallow /*%26q%3D*
Disallow /*?q=*
Disallow /*%26filter%3D*
Disallow /*?filter=*
Disallow /*%26size%3D*
Disallow /*?size=*
Disallow /login$
Disallow /carrinho$
Disallow /cadastro$
Disallow /checkout$
Disallow /vale-presente
Disallow /meus-pedidos
Disallow /minha-conta
Disallow /minha-conta-afiliado

domaincrawler/3.0

Rule Path
Disallow /

dirbuster-0.12

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

baiduspider-video

Rule Path
Disallow /

baiduspider-image

Rule Path
Disallow /

baiduspider+

Rule Path
Disallow /

twengabot-discover

Rule Path
Disallow /

twengabot

Rule Path
Disallow /

twengabot-2.0

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

bdcbot

Rule Path
Disallow /

spbot

Rule Path
Disallow /

linkpadbot

Rule Path
Disallow /

wbsearchbot

Rule Path
Disallow /

addthis.com

Rule Path
Disallow /

exabot

Rule Path
Disallow /

yandeximages

Rule Path
Disallow /

yandex

Rule Path
Disallow /

slurp

Rule Path
Disallow /

spbot

Rule Path
Disallow /

everyonesocialbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://files.irroba.com.br/fortalez/feeds/sitemap.xml