whiskailabs.com
robots.txt

Robots Exclusion Standard data for whiskailabs.com

Resource Scan

Scan Details

Site Domain whiskailabs.com
Base Domain whiskailabs.com
Scan Status Ok
Last Scan2025-10-21T18:36:22+00:00
Next Scan 2025-10-28T18:36:22+00:00

Last Scan

Scanned2025-10-21T18:36:22+00:00
URL https://whiskailabs.com/robots.txt
Domain IPs 104.21.51.237, 172.67.191.177, 2606:4700:3033::6815:33ed, 2606:4700:3036::ac43:bfb1
Response IP 104.21.51.237
Found Yes
Hash e0ce1404c470bcdfaaed766b7a122cbf912258070d845ac4b23fc06b913f0cf7
SimHash 62b3ea93c5f4

Groups

*

Rule Path
Allow /

amazonbot

Rule Path
Disallow /

applebot-extended

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

meta-externalagent

Rule Path
Disallow /

*

Rule Path
Allow /
Allow /$
Allow /es/$
Allow /fr/$
Allow /ar/$
Allow /de/$
Allow /ru/$
Allow /hi/$
Allow /it/$
Allow /ja/$
Allow /tr/$
Allow /ko/$
Allow /vi/$
Allow /pl/$
Allow /uk/$
Allow /nl/$
Allow /el/$
Allow /cs/$
Allow /sv/$
Allow /da/$
Allow /fi/$
Allow /no/$
Allow /hu/$
Allow /th/$
Allow /ms/$
Allow /id/$
Allow /fil/$
Allow /sw/$
Allow /fa/$
Allow /ro/$
Allow /he/$
Allow /bg/$
Allow /sk/$
Allow /hr/$
Allow /sr/$
Allow /lt/$
Allow /lv/$
Allow /et/$
Allow /sl/$
Allow /sq/$
Allow /ka/$
Allow /hy/$
Allow /kk/$
Allow /uz/$
Allow /az/$
Allow /ps/$
Allow /ku/$
Allow /ta/$
Allow /te/$
Allow /kn/$
Allow /ml/$
Allow /si/$
Allow /mr/$
Allow /gu/$
Allow /pa/$
Allow /or/$
Allow /as/$
Allow /ne/$
Allow /si/$
Allow /km/$
Allow /lo/$
Allow /mn/$
Allow /my/$
Allow /bo/$
Allow /so/$
Allow /am/$
Allow /om/$
Allow /ha/$
Allow /ig/$
Allow /xh/$
Allow /af/$
Allow /cy/$
Allow /ga/$
Allow /gd/$
Allow /eu/$
Allow /ca/$
Allow /gl/$
Allow /lb/$
Allow /mt/$
Allow /is/$
Allow /fo/$
Allow /se/$
Allow /sm/$
Allow /ay/$
Allow /gn/$
Allow /eo/$
Allow /mk/$
Allow /bs/$
Allow /youtube
Allow /twitter
Allow /instagram
Allow /facebook
Allow /tiktok
Allow /reddit
Allow /twitch
Allow /kick
Allow /vk
Allow /linkedin
Allow /pinterest
Allow /snapchat
Allow /whatsapp
Allow /vimeo
Allow /dailymotion
Allow /imgur
Allow /tumblr
Allow /medium
Allow /spotify
Allow /soundcloud
Allow /telegram

googlebot

Rule Path
Allow /*.js$
Allow /*.css$

bingbot

Rule Path
Allow /*.js$
Allow /*.css$

yandex

Rule Path
Allow /*.js$
Allow /*.css$

Other Records

Field Value
sitemap https://www.whiskailabs.com/sitemap.xml

Comments

  • As a condition of accessing this website, you agree to abide by the following
  • content signals:
  • (a) If a content-signal = yes, you may collect content for the corresponding
  • use.
  • (b) If a content-signal = no, you may not collect content for the
  • corresponding use.
  • (c) If the website operator does not include a content signal for a
  • corresponding use, the website operator neither grants nor restricts
  • permission via content signal with respect to the corresponding use.
  • The content signals and their meanings are:
  • search: building a search index and providing search results (e.g., returning
  • hyperlinks and short excerpts from your website's contents). Search does not
  • include providing AI-generated search summaries.
  • ai-input: inputting content into one or more AI models (e.g., retrieval
  • augmented generation, grounding, or other real-time taking of content for
  • generative AI search answers).
  • ai-train: training or fine-tuning AI models.
  • ANY RESTRICTIONS EXPRESSED VIA CONTENT SIGNALS ARE EXPRESS RESERVATIONS OF
  • RIGHTS UNDER ARTICLE 4 OF THE EUROPEAN UNION DIRECTIVE 2019/790 ON COPYRIGHT
  • AND RELATED RIGHTS IN THE DIGITAL SINGLE MARKET.
  • BEGIN Cloudflare Managed content
  • END Cloudflare Managed Content
  • Allow access to social platforms
  • Specific instructions for Google
  • Specific instructions for Bing
  • Specific instructions for Yandex
  • Sitemap location
  • Crawl-delay for all bots (optional, remove if not needed)
  • Crawl-delay: 5

Warnings

  • `content-signal` is not a known field.