leilaoninja.com
robots.txt

Robots Exclusion Standard data for leilaoninja.com

Resource Scan

Scan Details

Site Domain leilaoninja.com
Base Domain leilaoninja.com
Scan Status Ok
Last Scan2025-11-16T08:52:57+00:00
Next Scan 2025-12-16T08:52:57+00:00

Last Scan

Scanned2025-11-16T08:52:57+00:00
URL https://leilaoninja.com/robots.txt
Domain IPs 104.21.9.185, 172.67.143.104, 2606:4700:3035::6815:9b9, 2606:4700:3035::ac43:8f68
Response IP 172.67.143.104
Found Yes
Hash 8e698148765ae4fe3de4bf5dd6ace8bb9d968340ac86f3c727fa90c6e8c25192
SimHash 44754f52cd54

Groups

*

Rule Path
Allow /

amazonbot

Rule Path
Disallow /

applebot-extended

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

meta-externalagent

Rule Path
Disallow /

barkrowler

Rule Path
Disallow /search
Disallow /imovel/

amazonbot

Rule Path
Disallow /search
Disallow /imovel/

semrushbot

Rule Path
Disallow /search
Disallow /imovel/

dotbot

Rule Path
Disallow /search
Disallow /imovel/

claudebot

Rule Path
Disallow /search
Disallow /imovel/

facebookexternalhit

Rule Path
Disallow /search
Disallow /imovel/

facebookcatalog

Rule Path
Disallow /search
Disallow /imovel/

meta-externalagent

Rule Path
Disallow /search
Disallow /imovel/

*

Rule Path
Allow /
Disallow /admin/
Disallow /api/
Disallow /temp/
Disallow /storage/

Comments

  • As a condition of accessing this website, you agree to abide by the following
  • content signals:
  • (a) If a content-signal = yes, you may collect content for the corresponding
  • use.
  • (b) If a content-signal = no, you may not collect content for the
  • corresponding use.
  • (c) If the website operator does not include a content signal for a
  • corresponding use, the website operator neither grants nor restricts
  • permission via content signal with respect to the corresponding use.
  • The content signals and their meanings are:
  • search: building a search index and providing search results (e.g., returning
  • hyperlinks and short excerpts from your website's contents). Search does not
  • include providing AI-generated search summaries.
  • ai-input: inputting content into one or more AI models (e.g., retrieval
  • augmented generation, grounding, or other real-time taking of content for
  • generative AI search answers).
  • ai-train: training or fine-tuning AI models.
  • ANY RESTRICTIONS EXPRESSED VIA CONTENT SIGNALS ARE EXPRESS RESERVATIONS OF
  • RIGHTS UNDER ARTICLE 4 OF THE EUROPEAN UNION DIRECTIVE 2019/790 ON COPYRIGHT
  • AND RELATED RIGHTS IN THE DIGITAL SINGLE MARKET.
  • BEGIN Cloudflare Managed content
  • END Cloudflare Managed Content
  • Robots.txt para Leilão Ninja
  • Bloqueia bots específicos de crawling das rotas sensíveis
  • Babbar.tech Barkrowler
  • Amazon Amazonbot
  • Semrush Bot
  • Moz DotBot
  • Anthropic Claude Bot
  • Facebook External Hit
  • Facebook Catalog
  • Meta External Agent
  • Permitir outros bots em geral, mas bloquear rotas específicas
  • Sitemap (opcional - adicione se você tiver)
  • Sitemap: https://www.leilaoninja.com/sitemap.xml

Warnings

  • `content-signal` is not a known field.