nutricaoemfoco.com.br
robots.txt

Robots Exclusion Standard data for nutricaoemfoco.com.br

Resource Scan

Scan Details

Site Domain nutricaoemfoco.com.br
Base Domain nutricaoemfoco.com.br
Scan Status Ok
Last Scan2026-03-29T08:29:14+00:00
Next Scan 2026-04-05T08:29:14+00:00

Last Scan

Scanned2026-03-29T08:29:14+00:00
URL https://nutricaoemfoco.com.br/robots.txt
Domain IPs 104.21.71.102, 172.67.144.98, 2606:4700:3032::ac43:9062, 2606:4700:3033::6815:4766
Response IP 104.21.71.102
Found Yes
Hash ff4b74b25b37a563801c9d536cd520782d53b07681d662beb0305375e97b87fc
SimHash 98f2ddc4261e

Groups

*

Rule Path
Allow /
Disallow /search/
Disallow /?s=
Disallow /search.php
Disallow *?s=
Disallow *%26s%3D
Disallow /search/
Disallow /search*

sogou

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

easouspider

Rule Path
Disallow /

yisouspider

Rule Path
Disallow /

haosouspider

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

dataforseobot

Rule Path
Disallow /

zoominfobot

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

rogerbot

Rule Path
Disallow /

aspiegelbot

Rule Path
Disallow /

zitebot

Rule Path
Disallow /

zmeu

Rule Path
Disallow /

zoominfobot

Rule Path
Disallow /

zumbot

Rule Path
Disallow /

zyborg

Rule Path
Disallow /

abonti

Rule Path
Disallow /

Other Records

Field Value
sitemap https://nutricaoemfoco.com.br/sitemap_index.xml

Comments

  • Configuração geral para todos os bots permitidos
  • Motores de busca chineses
  • Bots conhecidos por spam/scraping
  • Crawlers agressivos conhecidos
  • Bots maliciosos

Warnings

  • 8 invalid lines.