comercia.io
robots.txt

Robots Exclusion Standard data for comercia.io

Resource Scan

Scan Details

Site Domain comercia.io
Base Domain comercia.io
Scan Status Ok
Last Scan2026-02-08T18:42:54+00:00
Next Scan 2026-03-10T18:42:54+00:00

Last Scan

Scanned2026-02-08T18:42:54+00:00
URL https://comercia.io/robots.txt
Domain IPs 104.26.10.74, 104.26.11.74, 172.67.74.158, 2606:4700:20::681a:a4a, 2606:4700:20::681a:b4a, 2606:4700:20::ac43:4a9e
Response IP 172.67.74.158
Found Yes
Hash aa191ee9dddefd78ab65dbe411be8068f527d224afb0950d3a73991e94340dd9
SimHash 46314953cd55

Groups

*

Rule Path
Allow /

amazonbot

Rule Path
Disallow /

applebot-extended

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

meta-externalagent

Rule Path
Disallow /

claudebot
anthropic-ai
claude-web
siteauditbot
semrushbot-ba
semrushbot-si
semrushbot-swa
semrushbot-ct
plitsignalbot
semrushbot-coub
mozilla/5.0 applewebkit/537.36 (khtml, like gecko; compatible; geedoproductsearch; +http://www.geedo.com/product-search.html) chrome/79.0.3945.88 safari/537.36

Rule Path
Disallow /

*

Rule Path
Disallow /cart
Disallow /payment
Disallow /addresses
Disallow /shipping
Disallow /payment
Disallow /checkout
Disallow /share
Disallow /summary
Disallow /clientes

adsbot-google

Rule Path
Disallow /cart
Disallow /payment
Disallow /addresses
Disallow /shipping
Disallow /payment
Disallow /checkout
Disallow /share
Disallow /summary
Disallow /clientes

Other Records

Field Value
sitemap https://comercia.io/sitemap
sitemap https://comercia.io/sitemap

Comments

  • As a condition of accessing this website, you agree to abide by the following
  • content signals:
  • (a) If a Content-Signal = yes, you may collect content for the corresponding
  • use.
  • (b) If a Content-Signal = no, you may not collect content for the
  • corresponding use.
  • (c) If the website operator does not include a Content-Signal for a
  • corresponding use, the website operator neither grants nor restricts
  • permission via Content-Signal with respect to the corresponding use.
  • The content signals and their meanings are:
  • search: building a search index and providing search results (e.g., returning
  • hyperlinks and short excerpts from your website's contents). Search does not
  • include providing AI-generated search summaries.
  • ai-input: inputting content into one or more AI models (e.g., retrieval
  • augmented generation, grounding, or other real-time taking of content for
  • generative AI search answers).
  • ai-train: training or fine-tuning AI models.
  • ANY RESTRICTIONS EXPRESSED VIA CONTENT SIGNALS ARE EXPRESS RESERVATIONS OF
  • RIGHTS UNDER ARTICLE 4 OF THE EUROPEAN UNION DIRECTIVE 2019/790 ON COPYRIGHT
  • AND RELATED RIGHTS IN THE DIGITAL SINGLE MARKET.
  • BEGIN Cloudflare Managed content
  • END Cloudflare Managed Content

Warnings

  • `content-signal` is not a known field.