c4i-cyber.com
robots.txt

Robots Exclusion Standard data for c4i-cyber.com

Resource Scan

Scan Details

Site Domain c4i-cyber.com
Base Domain c4i-cyber.com
Scan Status Ok
Last Scan2025-09-25T18:02:35+00:00
Next Scan 2025-10-02T18:02:35+00:00

Last Scan

Scanned2025-09-25T18:02:35+00:00
URL https://c4i-cyber.com/robots.txt
Domain IPs 104.21.58.40, 172.67.155.247, 2606:4700:3036::ac43:9bf7, 2606:4700:3037::6815:3a28
Response IP 104.21.58.40
Found Yes
Hash 6d9f5350e35741d8e752bc442ab6438e0b43e5777a4e98a912b066f73b946769
SimHash 443588d3c5d5

Groups

*

Rule Path
Allow /

amazonbot

Rule Path
Disallow /

applebot-extended

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

meta-externalagent

Rule Path
Disallow /

mediapartners-google

Rule Path
Disallow

googlebot

Rule Path
Disallow

msnbot

Rule Path
Disallow

slurp

Rule Path
Disallow

teoma

Rule Path
Disallow

twiceler

Rule Path
Disallow

gigabot

Rule Path
Disallow

scrubby

Rule Path
Disallow

robozilla

Rule Path
Disallow

ia_archiver

Rule Path
Disallow

*

Rule Path
Disallow /

Comments

  • As a condition of accessing this website, you agree to abide by the
  • following content-signals:
  • (a) If a content-signal = yes, you may collect content for the
  • corresponding use.
  • (b) If a content-signal = no, you may not collect content for the
  • corresponding use.
  • (c) If the website operator does not include a content signal for a
  • corresponding use, the website operator neither grants nor restricts
  • permission via content signal with respect to the corresponding use.
  • The content signals and their meanings are:
  • search: building a search index and providing search results (e.g., returning
  • hyperlinks and short excerpts from your website's contents). Search
  • does not include providing AI-generated search summaries.
  • ai-input: inputting content into one or more AI models (e.g., retrieval
  • augmented generation, grounding, or other real-time taking of
  • content for generative AI search answers).
  • ai-train: training or fine-tuning AI models.
  • ANY RESTRICTIONS EXPRESSED VIA CONTENT-SIGNALS ARE EXPRESS RESERVATIONS OF
  • RIGHTS UNDER ARTICLE 4 OF THE EUROPEAN UNION DIRECTIVE 2019/790 ON COPYRIGHT
  • AND RELATED RIGHTS IN THE DIGITAL SINGLE MARKET.
  • BEGIN Cloudflare Managed content
  • END Cloudflare Managed Content
  • Allows only major search engines and known friendly spiders
  • Major Search Engines and Known Friendly Spiders (allowed)
  • Google adsense crawler analyzes for ad serving
  • Google crawler indexes in search database
  • Everyone Else (NOT allowed)

Warnings

  • `content-signal` is not a known field.