do.agentiz.com
robots.txt

Robots Exclusion Standard data for do.agentiz.com

Resource Scan

Scan Details

Site Domain do.agentiz.com
Base Domain agentiz.com
Scan Status Ok
Last Scan2025-12-26T14:42:00+00:00
Next Scan 2026-01-02T14:42:00+00:00

Last Scan

Scanned2025-12-26T14:42:00+00:00
URL https://do.agentiz.com/robots.txt
Domain IPs 104.21.23.178, 172.67.212.117, 2606:4700:3034::ac43:d475, 2606:4700:3036::6815:17b2
Response IP 104.21.23.178
Found Yes
Hash 1d9270a1c8ec4183c6f4b0642d75a84fb56c8bdf7b415e6b2a16d8eaefc9fc1b
SimHash 483151f549fd

Groups

googlebot
googlebot-image
googleother
bingbot
bingpreview
msnbot
applebot
duckduckbot
ecosiabot
pinterestbot
seznambot
qwantify
exabot
orangebot
slurp
yeti
daumoa
coccocbot
yaanibot
facebookexternalhit
facebookcatalog
instagram
linkedinbot
twitterbot
meta-externalfetcher
telegrambot
viber
whatsapp
cloudflareobservatory
feedfetcher-google
google-imageproxy
google-read-aloud
google-site-verification
mediapartners-google
adsbot-google
adsbot-msn
uptimerobot
chatgpt-user
oai-searchbot
claude-web
perplexitybot
youbot
phindbot
kagibot
grokbot
xai-grok

Rule Path
Allow /
Disallow /*-xs.jpg
Disallow /*-xs.webp
Disallow /*-xxs.jpg
Disallow /*-xxs.webp
Disallow /*-xxxs.jpg
Disallow /*-xxxs.webp
Disallow /*/?type=*
Disallow /*/?search=*
Disallow /assets
Disallow /*/assets

bytespider
ahrefsbot
ahrefssiteaudit
mj12bot
semrushbot
gptbot
ccbot
google-extended

Rule Path
Disallow /

*

Rule Path
Allow /$
Disallow /

Other Records

Field Value
sitemap https://do.agentiz.com/sitemap.xml

Comments

  • DO do.agentiz.com
  • As a condition of accessing this website, automated agents (including crawlers,
  • bots, scrapers, AI systems, and aggregators) agree to the following:
  • 1) You may crawl this website only to the extent explicitly permitted by the
  • rules below (User-agent / Allow / Disallow).
  • 2) You may use our content for:
  • - search indexing and displaying search results (including links, titles,
  • and short snippets), where allowed by robots.txt;
  • - AI input / retrieval (for answering user queries in real time),
  • where allowed by robots.txt and applicable content signals.
  • 3) You may NOT:
  • - use our content for training or fine-tuning AI models;
  • - perform bulk scraping or large-scale copying for the purpose of building
  • commercial databases, aggregators, or mirrored listing portals;
  • - republish, resell, or systematically reuse our content without our
  • prior written consent.
  • Any restrictions expressed in this robots.txt file (including content signals)
  • constitute an express reservation of rights, including under Article 4 of
  • Directive (EU) 2019/790 and any similar laws in other jurisdictions.
  • Additional permissions for specific bots or services are granted only where
  • they are explicitly listed as allowed in the rules below.
  • For questions, partnership requests, or additional permissions,
  • please contact us at:
  • https://www.agentiz.com/en/support/feedback

Warnings

  • `content-signal` is not a known field.