nucalm.com
robots.txt

Robots Exclusion Standard data for nucalm.com

Resource Scan

Scan Details

Site Domain nucalm.com
Base Domain nucalm.com
Scan Status Ok
Last Scan2025-07-05T00:41:36+00:00
Next Scan 2025-07-19T00:41:36+00:00

Last Scan

Scanned2025-07-05T00:41:36+00:00
URL https://nucalm.com/robots.txt
Domain IPs 184.169.135.34, 52.8.178.132
Response IP 184.169.135.34
Found Yes
Hash ae3f7186585048555dd257b7922940e00106a1e8ec4d7f90bce405f88d2af00d
SimHash 0a59db54889b

Groups

*

Rule Path
Disallow /nucalm-confidential
Disallow /assets/investment-docs/
Disallow /assets/stockholders-docs/

etaospider

Rule Path
Disallow /

yandexbot

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://nucalm.com/sitemap-index.xml

Comments

  • User-agent: ChatGPT-User
  • Disallow: /
  • User-agent: GPTBot
  • Disallow: /
  • Facebook
  • User-agent: meta-externalagent
  • Disallow: /
  • User-agent: meta-externalfetcher
  • Disallow: /
  • User-agent: facebookexternalhit
  • Disallow: /
  • Gemini / Bard
  • User-agent: Googlebot-extended
  • Disallow: /
  • GoogleAgent-Mariner
  • User-agent: GoogleAgent-Mariner
  • Disallow: /