wsmith.ca
robots.txt

Robots Exclusion Standard data for wsmith.ca

Resource Scan

Scan Details

Site Domain wsmith.ca
Base Domain wsmith.ca
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2025-09-21T02:34:55+00:00
Next Scan 2025-12-20T02:34:55+00:00

Last Successful Scan

Scanned2025-03-02T01:38:33+00:00
URL https://wsmith.ca/robots.txt
Redirect https://www.wsmith.ca/robots.txt
Redirect Domain www.wsmith.ca
Redirect Base wsmith.ca
Domain IPs 45.33.7.253
Redirect IPs 45.33.7.253
Response IP 45.33.7.253
Found Yes
Hash f391f0304c7b42b370d18c0bda6de049f844281591ed509ded809cfc0402713d
SimHash 6a14d066f697

Groups

aa-site-audit-crawler

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

mauibot

Rule Path
Disallow /

megaindex.ru

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

ravencrawler

Rule Path
Disallow /

seekportbot

Rule Path
Disallow /

seekport crawler

Rule Path
Disallow /

seznambot

Rule Path
Disallow /

slurp

Rule Path
Disallow /

yandexbot

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

omgili

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

*

Rule Path
Disallow /account
Disallow /cart
Disallow /checkout
Disallow /jewelrybox
Disallow /newsletter
Disallow /search

Other Records

Field Value
crawl-delay 10

Other Records

Field Value
sitemap https://www.wsmith.ca/sitemap.xml

Comments

  • Slow down bots
  • Disable certain pages