carp.ca
robots.txt

Robots Exclusion Standard data for carp.ca

Resource Scan

Scan Details

Site Domain carp.ca
Base Domain carp.ca
Scan Status Ok
Last Scan2026-02-13T23:57:48+00:00
Next Scan 2026-02-20T23:57:48+00:00

Last Scan

Scanned2026-02-13T23:57:48+00:00
URL https://carp.ca/robots.txt
Redirect https://www.carp.ca/robots.txt
Redirect Domain www.carp.ca
Redirect Base carp.ca
Domain IPs 13.35.238.109, 13.35.238.13, 13.35.238.52, 13.35.238.60, 2600:9000:2085:2a00:1b:5a8d:f7c0:93a1, 2600:9000:2085:3600:1b:5a8d:f7c0:93a1, 2600:9000:2085:3800:1b:5a8d:f7c0:93a1, 2600:9000:2085:7000:1b:5a8d:f7c0:93a1, 2600:9000:2085:8a00:1b:5a8d:f7c0:93a1, 2600:9000:2085:c800:1b:5a8d:f7c0:93a1, 2600:9000:2085:e200:1b:5a8d:f7c0:93a1, 2600:9000:2085:ea00:1b:5a8d:f7c0:93a1
Redirect IPs 13.35.238.109, 13.35.238.13, 13.35.238.52, 13.35.238.60, 2600:9000:2085:1c00:1b:5a8d:f7c0:93a1, 2600:9000:2085:2a00:1b:5a8d:f7c0:93a1, 2600:9000:2085:8200:1b:5a8d:f7c0:93a1, 2600:9000:2085:9a00:1b:5a8d:f7c0:93a1, 2600:9000:2085:9c00:1b:5a8d:f7c0:93a1, 2600:9000:2085:cc00:1b:5a8d:f7c0:93a1, 2600:9000:2085:e600:1b:5a8d:f7c0:93a1, 2600:9000:2085:e800:1b:5a8d:f7c0:93a1
Response IP 13.35.238.52
Found Yes
Hash d85b8b5b00d62973de61a0717468815be9f0d4bc1993b2629649e58facb75ab5
SimHash f819191bc7f1

Groups

*

Rule Path
Disallow /bookmarks/
Disallow /*/upload-photo/

googlebot

Rule Path
Allow /

googlebot-news

Rule Path
Allow /

googlebot-image

Rule Path
Allow /

gptbot

Rule Path
Allow /

amazonbot

Rule Path
Allow /

anthropic-ai

Rule Path
Allow /

applebot-extended

Rule Path
Allow /

awariorssbot
awariosmartbot

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Allow /

claudebot

Rule Path
Allow /

claude-web

Rule Path
Allow /

cohere-ai

Rule Path
Disallow /

dataforseobot

Rule Path
Disallow /

diffbot

Rule Path
Disallow /

facebookbot

Rule Path
Allow /

friendlycrawler

Rule Path
Disallow /

google-cloudvertexbot

Rule Path
Allow /

google-extended

Rule Path
Allow /

gptbot

Rule Path
Allow /

imagesiftbot

Rule Path
Disallow /

magpie-crawler

Rule Path
Disallow /

newsnow

Rule Path
Disallow /

news-please

Rule Path
Disallow /

oai-searchbot

Rule Path
Allow /

omgili

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

peer39_crawler
peer39_crawler/1.0

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

quora-bot

Rule Path
Disallow /

scrapy

Rule Path
Disallow /

turnitinbot

Rule Path
Disallow /

oai-searchbot

Rule Path
Allow /

gptbot

Rule Path
Allow /

grok

Rule Path
Allow /

Other Records

Field Value
sitemap https://www.carp.ca/sitemap.xml

Comments

  • ZoomerMedia Limited content is made available for your personal, non-commercial
  • use subject to our Terms of Service here:
  • https://zoomermedia.ca/legal-privacy-policy/.
  • Use of any device, tool, or process designed to data mine or scrape the content
  • using automated means is prohibited without prior written permission from
  • ZoomerMedia Limited. Prohibited uses include but are not limited to:
  • (1) text and data mining activities
  • (2) the development of any software, machine learning, artificial intelligence (AI),
  • and/or large language models (LLMs);
  • (3) creating or providing archived or cached data sets containing our content to others; and/or
  • (4) any commercial purposes.
  • Contact https://zoomermedia.ca/contact-us/ for assistance.
  • site-specific
  • Googlebot Specific Rules
  • Allow Google bots full access
  • Block specific AI training bots
  • User-agent: Googlebot
  • Disallow Rules
  • Other Bot Rules
  • Sitemaps