cleanorigin.com
robots.txt

Robots Exclusion Standard data for cleanorigin.com

Resource Scan

Scan Details

Site Domain cleanorigin.com
Base Domain cleanorigin.com
Scan Status Ok
Last Scan2025-05-04T10:49:34+00:00
Next Scan 2025-06-03T10:49:34+00:00

Last Scan

Scanned2025-05-04T10:49:34+00:00
URL https://cleanorigin.com/robots.txt
Redirect https://www.cleanorigin.com/robots.txt
Redirect Domain www.cleanorigin.com
Redirect Base cleanorigin.com
Domain IPs 172.66.41.23, 172.66.42.233, 2606:4700:3108::ac42:2917, 2606:4700:3108::ac42:2ae9
Redirect IPs 172.66.41.23, 172.66.42.233, 2606:4700:3108::ac42:2917, 2606:4700:3108::ac42:2ae9
Response IP 172.66.41.23
Found Yes
Hash f493061c79aabd04b1ec6cdd350f7cf286ccc70f33b9ede0a1b93293ca2a6d71
SimHash 5bf4914ade72

Groups

*

Rule Path
Allow /*.js
Allow /*.css
Allow /*?ver=
Disallow /index.php/
Disallow /*?
Disallow /checkout/
Disallow /app/
Disallow /bin/
Disallow /dev/
Disallow /phpserver/
Disallow /pub/
Disallow /lib/
Disallow /*.php$
Disallow /pkginfo/
Disallow /report/
Disallow /var/
Disallow /catalog/
Disallow /customer/
Disallow /customer/account/
Disallow /customer/account/login/
Disallow /checkout/
Disallow /sendfriend/
Disallow /review/
Disallow /*SID%3D
Disallow /turpentine/
Disallow /sales/order/history/
Disallow /ringbuilder/index/createbundledirect/
Disallow /product/builder/create/
Disallow /no-route/
Disallow /en_us/
Disallow /en_gb/customer/account/
Disallow /en_ca/customer/account/
Disallow /en_au/customer/account/
Disallow /fastlyCdn/*
Disallow /en_ca/fastlyCdn/*
Disallow /en_au/fastlyCdn/*
Disallow /en_gb/fastlyCdn/*
Disallow /blog/author/
Disallow /blog/wp-content/

googlebot

Rule Path
Disallow
Disallow /index.php/
Disallow /*?
Disallow /checkout/
Disallow /app/
Disallow /bin/
Disallow /dev/
Disallow /phpserver/
Disallow /pub/
Disallow /lib/
Disallow /*.php$
Disallow /pkginfo/
Disallow /report/
Disallow /var/
Disallow /catalog/
Disallow /customer/
Disallow /customer/account/
Disallow /customer/account/login/
Disallow /checkout/
Disallow /sendfriend/
Disallow /review/
Disallow /*SID%3D
Disallow /turpentine/
Disallow /sales/order/history/
Disallow /ringbuilder/index/createbundledirect/
Disallow /product/builder/create/
Disallow /no-route/
Disallow /en_us/
Disallow /en_gb/customer/account/
Disallow /en_ca/customer/account/
Disallow /en_au/customer/account/
Disallow /blog/author/
Disallow /blog/wp-content/

googlebot-mobile

Rule Path
Allow /*.js
Allow /*.css

googlebot-image

Rule Path
Disallow

mozbot

Rule Path
Disallow /index.php/
Disallow /*?
Disallow /checkout/
Disallow /app/
Disallow /bin/
Disallow /dev/
Disallow /phpserver/
Disallow /pub/
Disallow /lib/
Disallow /*.php$
Disallow /pkginfo/
Disallow /report/
Disallow /var/
Disallow /catalog/
Disallow /customer/
Disallow /customer/account/
Disallow /customer/account/login/
Disallow /checkout/
Disallow /sendfriend/
Disallow /review/
Disallow /*SID%3D
Disallow /turpentine/
Disallow /sales/order/history/
Disallow /ringbuilder/index/createbundledirect/
Disallow /product/builder/create/
Disallow /no-route/
Disallow /en_us/
Disallow /en_gb/customer/account/
Disallow /en_ca/customer/account/
Disallow /en_au/customer/account/
Disallow /blog/author/
Disallow /blog/wp-content/

*

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 20

sitebulb

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 2

pinterestbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 1

archive.org_bot

Rule Path
Disallow /

yahoo! slurp

Rule Path
Disallow /

aranhabot

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

yandex

Rule Path
Disallow /

grapeshotcrawler

Rule Path
Disallow /

gluten free crawler

Rule Path
Disallow /

flipboardproxy

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

lexxebot/1.0

Rule Path
Disallow /

obot

Rule Path
Disallow /

sputnikbot

Rule Path
Disallow /

symfony spider

Rule Path
Disallow /

obot

Rule Path
Disallow /

symfony spider

Rule Path
Disallow /

easouspider

Rule Path
Disallow /

panscient.com

Rule Path
Disallow /

lexxebot/1.0

Rule Path
Disallow /

nextgensearchbot

Rule Path
Disallow /

rogerbot

Rule Path
Disallow /

sitebot/0.1

Rule Path
Disallow /

sitebot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

crystalsemanticsbot

Rule Path
Disallow /

crystalsemanticsbot

Rule Path
Disallow /

netseer crawler

Rule Path
Disallow /

trovitbot

Rule Path
Disallow /

lexxebot

Rule Path
Disallow /

sosospider

Rule Path
Disallow /

sosospider+(+http://help.soso.com/webspider.htm)

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

ezooms

Rule Path
Disallow /

discobot

Rule Path
Disallow /

jyxobot

Rule Path
Disallow /

sogou

Rule Path
Disallow /

sogou spider

Rule Path
Disallow /

sogou web spider/4.0(+http://www.sogou.com/docs/help/webmasters.htm

Product Comment
sogou web spider/4.0(+http://www.sogou.com/docs/help/webmasters.htm 07)
Rule Path
Disallow /

sistrix

Rule Path
Disallow /

heritrix

Rule Path
Disallow /

garlikcrawler/1.1 (http://garlik.com/, crawler@garlik.com)

Rule Path
Disallow /

nerdbynature.bot

Rule Path
Disallow /

psbot

Rule Path
Disallow /

wbsearchbot

Rule Path
Disallow /

addthis.com

Rule Path
Disallow /

ia_archiver

Rule Path
Disallow /

proximic

Rule Path
Disallow /

discoverybot

Rule Path
Disallow /

bl.uk_lddc_bot

Rule Path
Disallow /

istellabot

Rule Path
Disallow /

seokicks

Rule Path
Disallow /

seokicks-robot

Rule Path
Disallow /

unisterbot

Rule Path
Disallow /

bender

Rule Path
Disallow /

wotbox

Rule Path
Disallow /

jikespider

Rule Path
Disallow /

netestate ne crawler

Rule Path
Disallow /

exabot

Rule Path
Disallow /

pixray-seeker

Rule Path
Disallow /

linguee

Rule Path
Disallow /

integromedb

Rule Path
Disallow /

searchmetricsbot

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

bdcbot

Rule Path
Disallow /

grapeshotcrawler

Rule Path
Disallow /

wesee:search

Rule Path
Disallow /

turnitinbot

Rule Path
Disallow /

admantx

Rule Path
Disallow /

spbot

Rule Path
Disallow /

discobot

Rule Path
Disallow /

mail.ru_bot

Rule Path
Disallow /

dot-bot

Rule Path
Disallow /

btt agent sfoaws

Rule Path
Disallow /

productadsbot

Rule Path
Disallow /

addthis

Rule Path
Disallow /

python-requests

Rule Path
Disallow /

viralvideochart

Rule Path
Disallow /

apache-httpclient

Rule Path
Disallow /

fatbot

Rule Path
Disallow /

sogou web spider

Rule Path
Disallow /

feedburner

Rule Path
Disallow /

gwpimages

Rule Path
Disallow /

xovibot

Rule Path
Disallow /

tbot-nutch

Rule Path
Disallow /

grapeshotcrawler

Rule Path
Disallow /

typhoeus

Rule Path
Disallow /

android

Rule Path
Disallow /

ning

Rule Path
Disallow /

mediatoolkitbot

Rule Path
Disallow /

stores.pl

Rule Path
Disallow /

chimpfeedr.com

Rule Path
Disallow /

genieo

Rule Path
Disallow /

ravencrawler

Rule Path
Disallow /

crazywebcrawler

Rule Path
Disallow /

commoncrawler

Rule Path
Disallow /

rogerbot

Rule Path
Disallow /

magpie-crawler

Rule Path
Disallow /

r6_commentreader

Rule Path
Disallow /

kyoto-tohoku-crawler

Rule Path
Disallow /

icc-crawler

Rule Path
Disallow /

y!j-asr

Rule Path
Disallow /

proximic

Rule Path
Disallow /

sogou web spider

Rule Path
Disallow /

worldclient.dll

Rule Path
Disallow /

ltx71

Rule Path
Disallow /

magpie-crawler

Rule Path
Disallow /

alexabot

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.cleanorigin.com/blog/sitemap_index.xml
sitemap https://www.cleanorigin.com/pub/media/sitemap.xml