cbdnorwich.co.uk
robots.txt

Robots Exclusion Standard data for cbdnorwich.co.uk

Resource Scan

Scan Details

Site Domain cbdnorwich.co.uk
Base Domain cbdnorwich.co.uk
Scan Status Ok
Last Scan2024-09-22T00:29:18+00:00
Next Scan 2024-10-22T00:29:18+00:00

Last Scan

Scanned2024-09-22T00:29:18+00:00
URL https://cbdnorwich.co.uk/robots.txt
Domain IPs 88.208.199.19
Response IP 88.208.199.19
Found Yes
Hash 5bab56ed6347ed25bcc78dbc336f815eb5637c921e1540cd44c8aa4a405eaa35
SimHash c110730705b4

Groups

ccbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

omgili

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

diffbot

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

imagesiftbot

Rule Path
Disallow /

cohere-ai

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

seekportbot

Rule Path
Disallow /

awariorssbot

Rule Path
Disallow /

awariosmartbot

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

coccocbot

Rule Path
Disallow /

geedobot

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

msnbot

Rule Path
Disallow /

bingbot

Rule Path
Disallow /

sogou

Rule Path
Disallow /

yak

Rule Path
Disallow /

turnitin

Rule Path
Disallow /

barkrowler

Rule Path
Disallow /

nuclei

Rule Path
Disallow /

scrapy

Rule Path
Disallow /

crawler_eb_germany_2.0

Rule Path
Disallow /

seekport crawler

Rule Path
Disallow /

adscanner

Rule Path
Disallow /

pimeyes.com crawler

Rule Path
Disallow /

linguee bot

Rule Path
Disallow /

borneobot

Rule Path
Disallow /

seekport crawler

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

semrushbot-sa

Rule Path
Disallow /

idg/uk

Rule Path
Disallow /

crawler4j

Rule Path
Disallow /

pricebot

Rule Path
Disallow /

yandex

Rule Path
Disallow /

yandexbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

bl.uk

Rule Path
Disallow /

qwantify

Rule Path
Disallow /

bleriot

Rule Path
Disallow /

aspiegelbot

Rule Path
Disallow /

awariosmartbot

Rule Path
Disallow /

velenpublicwebcrawler

Rule Path
Disallow /

vagabondo

Rule Path
Disallow /

*

Rule Path
Disallow /lib/
Disallow /*.php$
Disallow /pkginfo/
Disallow /report/
Disallow /var/
Disallow /catalog/
Disallow /customer/
Disallow /sendfriend/
Disallow /review/
Disallow /*SID%3D
Disallow /*?
Disallow /checkout/
Disallow /onestepcheckout/
Disallow /customer/
Disallow /customer/account/
Disallow /customer/account/login/
Disallow /catalogsearch/
Disallow /catalog/product_compare/
Disallow /catalog/category/view/
Disallow /catalog/product/view/
Disallow /app/
Disallow /bin/
Disallow /dev/
Disallow /lib/
Disallow /phpserver/
Allow /pub/media/catalog/
Allow /pub/media/favicon/
Allow /pub/media/logo/
Allow /pub/media/wysiwyg/
Disallow /pub/
Disallow /tag/
Disallow /review/
Disallow /composer.json
Disallow /composer.lock
Disallow /CONTRIBUTING.md
Disallow /CONTRIBUTOR_LICENSE_AGREEMENT.html
Disallow /COPYING.txt
Disallow /Gruntfile.js
Disallow /LICENSE.txt
Disallow /LICENSE_AFL.txt
Disallow /nginx.conf.sample
Disallow /package.json
Disallow /php.ini.sample
Disallow /RELEASE_NOTES.txt
Disallow /*?*product_list_mode=
Disallow /*?*product_list_order=
Disallow /*?*product_list_limit=
Disallow /*?*product_list_dir=
Disallow /*.git
Disallow /*.CVS
Disallow /*.Zip$
Disallow /*.Svn$
Disallow /*.Idea$
Disallow /*.Sql$
Disallow /*.Tgz$

Other Records

Field Value
sitemap https://cbdnorwich.co.uk/sitemap.xml

Comments

  • Enables robots.txt rules for all crawlers
  • Magento sitemap: URL to your sitemap file in Magento
  • Disable checkout & customer account
  • Disable Search pages
  • Disable common folders
  • Disable Tag & Review (Avoid duplicate content)
  • Common files
  • Disable sorting (Avoid duplicate content)
  • Disable version control folders and others