gregoriopatinetes.com.br
robots.txt

Robots Exclusion Standard data for gregoriopatinetes.com.br

Resource Scan

Scan Details

Site Domain gregoriopatinetes.com.br
Base Domain gregoriopatinetes.com.br
Scan Status Ok
Last Scan2024-09-16T23:51:59+00:00
Next Scan 2024-10-16T23:51:59+00:00

Last Scan

Scanned2024-09-16T23:51:59+00:00
URL https://gregoriopatinetes.com.br/robots.txt
Domain IPs 104.21.95.183, 172.67.146.246, 2606:4700:3031::ac43:92f6, 2606:4700:3034::6815:5fb7
Response IP 172.67.146.246
Found Yes
Hash d238a6d00fe47b7b70ea414e9b55184c64e6fc9b51078024c8272f35637b2164
SimHash 3530c7acce11

Groups

*

Rule Path
Disallow /cdn-cgi/
Disallow /busca*
Disallow /*%26sort%3D*
Disallow /*%26order%3D*
Disallow /*%26limit%3D*
Disallow /*%26q%3D*
Disallow /*?q=*
Disallow /*%26filter%3D*
Disallow /*?filter=*
Disallow /*%26size%3D*
Disallow /*?size=*
Disallow /login$
Disallow /carrinho$
Disallow /cadastro$
Disallow /checkout$
Disallow /vale-presente
Disallow /meus-pedidos
Disallow /minha-conta
Disallow /minha-conta-afiliado

domaincrawler/3.0

Rule Path
Disallow /

dirbuster-0.12

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

baiduspider-video

Rule Path
Disallow /

baiduspider-image

Rule Path
Disallow /

baiduspider+

Rule Path
Disallow /

twengabot-discover

Rule Path
Disallow /

twengabot

Rule Path
Disallow /

twengabot-2.0

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

bdcbot

Rule Path
Disallow /

spbot

Rule Path
Disallow /

linkpadbot

Rule Path
Disallow /

wbsearchbot

Rule Path
Disallow /

addthis.com

Rule Path
Disallow /

exabot

Rule Path
Disallow /

yandeximages

Rule Path
Disallow /

yandex

Rule Path
Disallow /

slurp

Rule Path
Disallow /

spbot

Rule Path
Disallow /

everyonesocialbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://files.irroba.com.br/gregorio/feeds/sitemap.xml