techmalhas.com.br
robots.txt

Robots Exclusion Standard data for techmalhas.com.br

Resource Scan

Scan Details

Site Domain techmalhas.com.br
Base Domain techmalhas.com.br
Scan Status Ok
Last Scan2025-06-01T16:00:52+00:00
Next Scan 2025-07-01T16:00:52+00:00

Last Scan

Scanned2025-06-01T16:00:52+00:00
URL https://techmalhas.com.br/robots.txt
Domain IPs 104.21.85.4, 172.67.200.107, 2606:4700:3033::ac43:c86b, 2606:4700:3035::6815:5504
Response IP 104.21.85.4
Found Yes
Hash 840aac00ef729e64222176fe8628e8e6c891a7c50fc0ecc95c65fc6ad03394b2
SimHash 353046ad4a11

Groups

*

Rule Path
Disallow /cdn-cgi/
Disallow /busca*
Disallow /*%26sort%3D*
Disallow /*%26order%3D*
Disallow /*%26limit%3D*
Disallow /*%26q%3D*
Disallow /*?q=*
Disallow /*%26filter%3D*
Disallow /*?filter=*
Disallow /*%26size%3D*
Disallow /*?size=*
Disallow /login$
Disallow /carrinho$
Disallow /cadastro$
Disallow /checkout$
Disallow /vale-presente
Disallow /meus-pedidos
Disallow /minha-conta
Disallow /minha-conta-afiliado

dirbuster-0.12

Rule Path
Disallow /

domaincrawler/3.0

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

baiduspider-video

Rule Path
Disallow /

baiduspider-image

Rule Path
Disallow /

baiduspider+

Rule Path
Disallow /

twengabot-discover

Rule Path
Disallow /

twengabot

Rule Path
Disallow /

twengabot-2.0

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

bdcbot

Rule Path
Disallow /

spbot

Rule Path
Disallow /

linkpadbot

Rule Path
Disallow /

wbsearchbot

Rule Path
Disallow /

addthis.com

Rule Path
Disallow /

exabot

Rule Path
Disallow /

yandeximages

Rule Path
Disallow /

yandex

Rule Path
Disallow /

slurp

Rule Path
Disallow /

spbot

Rule Path
Disallow /

everyonesocialbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://techmalhas.com.br/sitemap_category.xml
sitemap https://techmalhas.com.br/sitemap_image.xml
sitemap https://techmalhas.com.br/sitemap_products.xml
sitemap https://techmalhas.com.br/sitemap_pages.xml