cbsc.ca
robots.txt

Robots Exclusion Standard data for cbsc.ca

Archived Snapshots

Resource Scan

Scan Details

Site Domain	cbsc.ca
Base Domain	cbsc.ca
Scan Status	Ok
Last Scan	2025-11-13T20:55:34+00:00
Next Scan	2025-12-13T20:55:34+00:00

Last Scan

Scanned	2025-11-13T20:55:34+00:00
URL	https://cbsc.ca/robots.txt
Domain IPs	192.95.19.213
Response IP	192.95.19.213
Found	Yes
Hash	943477da93ce9b092c670f714a0b48096240c3b92f243d1f5971c5c24674bcf0
SimHash	793cd111c1a0

Groups

*

Rule	Path
Allow	/decisionsarchive/
Allow	/transcripts/
Allow	/images/uploads/annual_reports
Disallow	/cgi-bin/
Disallow	/images/uploads/membersonly/
Disallow	/images/smileys/
Disallow	/images/signature_attachments/
Disallow	/images/pm_attachments/
Disallow	/images/forum_attachments/
Disallow	/images/avatars/

Rule

Path

Allow

/decisionsarchive/

Allow

/transcripts/

Allow

/images/uploads/annual_reports

Disallow

/cgi-bin/

Disallow

/images/uploads/membersonly/

Disallow

/images/smileys/

Disallow

/images/signature_attachments/

Disallow

/images/pm_attachments/

Disallow

/images/forum_attachments/

Disallow

/images/avatars/

amazonbot

Rule	Path
Disallow	/

Rule

Path

Disallow

anthropic-ai

Rule	Path
Disallow	/

Rule

Path

Disallow

awariorssbot
awariosmartbot

Rule	Path
Disallow	/

Rule

Path

Disallow

bytespider

Rule	Path
Disallow	/

Rule

Path

Disallow

ccbot

Rule	Path
Disallow	/

Rule

Path

Disallow

chatgpt-user

Rule	Path
Disallow	/

Rule

Path

Disallow

claudebot

Rule	Path
Disallow	/

Rule

Path

Disallow

claude-web

Rule	Path
Disallow	/

Rule

Path

Disallow

cohere-ai

Rule	Path
Disallow	/

Rule

Path

Disallow

dataforseobot

Rule	Path
Disallow	/

Rule

Path

Disallow

diffbot

Rule	Path
Disallow	/

Rule

Path

Disallow

facebookbot

Rule	Path
Disallow	/

Rule

Path

Disallow

google-extended

Rule	Path
Disallow	/

Rule

Path

Disallow

gptbot

Rule	Path
Disallow	/

Rule

Path

Disallow

magpie-crawler

Rule	Path
Disallow	/

Rule

Path

Disallow

newsnow

Rule	Path
Disallow	/

Rule

Path

Disallow

news-please

Rule	Path
Disallow	/

Rule

Path

Disallow

omgili

Rule	Path
Disallow	/

Rule

Path

Disallow

omgilibot

Rule	Path
Disallow	/

Rule

Path

Disallow

peer39_crawler
peer39_crawler/1.0

Rule	Path
Disallow	/

Rule

Path

Disallow

perplexitybot

Rule	Path
Disallow	/

Rule

Path

Disallow

scrapy

Rule	Path
Disallow	/

Rule

Path

Disallow

turnitinbot

Rule	Path
Disallow	/

Rule

Path

Disallow

applebot-extended

Rule	Path
Disallow	/

Rule

Path

Disallow

imagesiftbot

Rule	Path
Disallow	/

Rule

Path

Disallow

facebookexternalhit

Rule	Path
Allow	/?smid=

Rule

Path

Allow

/*?*smid=

twitterbot

Rule	Path
Allow	/?smid=

Rule

Path

Allow

/*?*smid=

Other Records

Field	Value
sitemap	https://cbsc.ca/sitemap.xml

Field

Value

sitemap

https://cbsc.ca/sitemap.xml

Comments

Disallow Rules
Other Bot Rules

cbsc.carobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

amazonbot

anthropic-ai

awariorssbotawariosmartbot

bytespider

ccbot

chatgpt-user

claudebot

claude-web

cohere-ai

dataforseobot

diffbot

facebookbot

google-extended

gptbot

magpie-crawler

newsnow

news-please

omgili

omgilibot

peer39_crawlerpeer39_crawler/1.0

perplexitybot

scrapy

turnitinbot

applebot-extended

imagesiftbot

facebookexternalhit

twitterbot

Other Records

Comments

cbsc.ca
robots.txt

awariorssbot
awariosmartbot

peer39_crawler
peer39_crawler/1.0