starluky.com
robots.txt

Robots Exclusion Standard data for starluky.com

Resource Scan

Scan Details

Site Domain starluky.com
Base Domain starluky.com
Scan Status Ok
Last Scan2024-09-28T16:43:14+00:00
Next Scan 2024-10-05T16:43:14+00:00

Last Scan

Scanned2024-09-28T16:43:14+00:00
URL https://starluky.com/robots.txt
Redirect https://www.starluky.com/robots.txt
Redirect Domain www.starluky.com
Redirect Base starluky.com
Domain IPs 104.21.96.6, 172.67.150.28, 2606:4700:3031::6815:6006, 2606:4700:3036::ac43:961c
Redirect IPs 104.21.96.6, 172.67.150.28, 2606:4700:3031::6815:6006, 2606:4700:3036::ac43:961c
Response IP 172.67.150.28
Found Yes
Hash f8dc4c1ae3e38e3e2dcf61a2139a883e77b336dddf5f004859a1174ddf9f8f7c
SimHash c234ffe945f2

Groups

ia_archiver

Rule Path Comment
Allow / Popular chinese search enginesUser-agent: Baiduspider
Allow / -

baiduspider/2.0

Rule Path
Allow /

baiduspider-video

Rule Path
Allow /

baiduspider-image

Rule Path
Allow /

sogou spider

Rule Path
Allow /

sogou web spider

Rule Path
Allow /

sosospider

Rule Path
Allow /

sosospider+

Rule Path
Allow /

sosospider/2.0

Rule Path
Allow /

yodao

Rule Path
Allow /

youdao

Rule Path
Allow /

youdaobot

Rule Path
Allow /

youdaobot/1.0

Rule Path Comment
Allow / Block Bad Bots. "AI recommended setting" by ChatGPTUser-agent: ia_archiver
Disallow / -

archive.org_bot

Rule Path
Disallow /

siteexplorer

Rule Path
Disallow /

spbot

Rule Path
Disallow /

wbsearchbot

Rule Path
Disallow /

linkdexbot

Rule Path
Disallow /

screaming frog seo spider

Rule Path
Disallow /

netestate ne crawler

Rule Path
Disallow /

moreover

Rule Path
Disallow /

sentibot

Rule Path
Disallow /

aboundexbot

Rule Path
Disallow /

proximic

Rule Path
Disallow /

obot

Rule Path
Disallow /

meanpathbot

Rule Path
Disallow /

nutch

Rule Path
Disallow /

turnitinbot

Rule Path
Disallow /

zoominfobot

Rule Path
Disallow /

zmeu

Rule Path
Disallow /

grapeshot

Rule Path
Disallow /

python-requests

Rule Path
Disallow /

go-http-client

Rule Path
Disallow /

apache-httpclient

Rule Path
Disallow /

libwww-perl

Rule Path
Disallow /

curl

Rule Path
Disallow /

wget

Rule Path Comment
Disallow / ChatGPT Bot Blocker - Block ChatGPT Bot from scrapping your contentUser-agent: GPTBot
Disallow / Spam Backlink BlockerDisallow: /feed/
Disallow /feed/$ -
Disallow /comments/feed -
Disallow /trackback/ -
Disallow */?author=* -
Disallow */author/* -
Disallow /author* -
Disallow /author/ -
Disallow */comments$ -
Disallow */feed -
Disallow */feed$ -
Disallow */trackback -
Disallow */trackback$ -
Disallow /?feed= -
Disallow /wp-comments -
Disallow /wp-feed -
Disallow /wp-trackback -
Disallow */replytocom%3D Block Bad Bots. Powered by Better Robots.txt ProUser-agent: GiftGhostBot
Disallow / -

seznam

Rule Path
Disallow /

paperlibot

Rule Path
Disallow /

genieo

Rule Path
Disallow /

dataprovider/6.101

Rule Path
Disallow /

dataprovidersiteexplorer

Rule Path
Disallow /

dazoobot/1.0

Rule Path
Disallow /

diffbot

Rule Path
Disallow /

domainstatsbot/1.0

Rule Path
Disallow /

dubaiindex

Rule Path
Disallow /

ecommercebot

Rule Path
Disallow /

expertsearchspider

Rule Path
Disallow /

feedbin

Rule Path
Disallow /

fetch/2.0a

Rule Path
Disallow /

ffbot/1.0

Rule Path
Disallow /

focusbot/1.1

Rule Path
Disallow /

huaweisymantecspider

Rule Path
Disallow /

huaweisymantecspider/1.0

Rule Path
Disallow /

jobdiggerspider

Rule Path
Disallow /

lemurwebcrawler

Rule Path
Disallow /

lipperheylinkexplorer

Rule Path
Disallow /

lssrocketcrawler/1.0

Rule Path
Disallow /

lyt.srv1.5

Rule Path
Disallow /

miadev/0.0.1

Rule Path
Disallow /

najdi.si/3.1

Rule Path
Disallow /

bountiibot

Rule Path
Disallow /

experibot_v1

Rule Path
Disallow /

bixocrawler

Rule Path
Disallow /

bixocrawler testcrawler

Rule Path
Disallow /

crawler4j

Rule Path
Disallow /

crowsnest/0.5

Rule Path
Disallow /

cukbot

Rule Path
Disallow /

dataprovider/6.92

Rule Path
Disallow /

dblbot/1.0

Rule Path
Disallow /

diffbot/0.1

Rule Path
Disallow /

digg deeper/v1

Rule Path
Disallow /

discobot/1.0

Rule Path
Disallow /

discobot/1.1

Rule Path
Disallow /

discobot/2.0

Rule Path
Disallow /

discoverybot/2.0

Rule Path
Disallow /

dlvr.it/1.0

Rule Path
Disallow /

domainstatsbot/1.0

Rule Path
Disallow /

drupact/0.7

Rule Path
Disallow /

ezooms/1.0

Rule Path
Disallow /

fastbot crawler beta 2.0

Rule Path
Disallow /

fastbot crawler beta 4.0

Rule Path
Disallow /

feedly social

Rule Path
Disallow /

feedly/1.0

Rule Path
Disallow /

feedlybot/1.0

Rule Path
Disallow /

feedspot

Rule Path
Disallow /

feedspotbot/1.0

Rule Path
Disallow /

clickagy intelligence bot v2

Rule Path
Disallow /

classbot

Rule Path
Disallow /

cispa vulnerability notification

Rule Path
Disallow /

cirrusexplorer/1.1

Rule Path
Disallow /

checksem/nutch-1.10

Rule Path
Disallow /

catchbot/5.0

Rule Path
Disallow /

catchbot/3.0

Rule Path
Disallow /

catchbot/2.0

Rule Path
Disallow /

catchbot/1.0

Rule Path
Disallow /

camontspider/1.0

Rule Path
Disallow /

buzzbot/1.0

Rule Path
Disallow /

buzzbot

Rule Path
Disallow /

businessseek.biz_spider

Rule Path
Disallow /

bubing

Rule Path
Disallow /

fyberspider/1.3

Rule Path
Disallow /

findlinks/1.1.6-beta5

Rule Path
Disallow /

g2reader-bot/1.0

Rule Path
Disallow /

findlinks/1.1.6-beta6

Rule Path
Disallow /

findlinks/2.0

Rule Path
Disallow /

findlinks/2.0.1

Rule Path
Disallow /

findlinks/2.0.2

Rule Path
Disallow /

findlinks/2.0.4

Rule Path
Disallow /

findlinks/2.0.5

Rule Path
Disallow /

findlinks/2.0.9

Rule Path
Disallow /

findlinks/2.1

Rule Path
Disallow /

findlinks/2.1.5

Rule Path
Disallow /

findlinks/2.1.3

Rule Path
Disallow /

findlinks/2.2

Rule Path
Disallow /

findlinks/2.5

Rule Path
Disallow /

findlinks/2.6

Rule Path
Disallow /

ffbot/1.0

Rule Path
Disallow /

findlinks/1.0

Rule Path
Disallow /

findlinks/1.1.3-beta8

Rule Path
Disallow /

findlinks/1.1.3-beta9

Rule Path
Disallow /

findlinks/1.1.4-beta7

Rule Path
Disallow /

findlinks/1.1.6-beta1

Rule Path
Disallow /

findlinks/1.1.6-beta1 yacy

Rule Path
Disallow /

findlinks/1.1.6-beta2

Rule Path
Disallow /

findlinks/1.1.6-beta3

Rule Path
Disallow /

findlinks/1.1.6-beta4

Rule Path
Disallow /

bixo

Rule Path
Disallow /

bixolabs/1.0

Rule Path
Disallow /

crawlera/1.10.2

Rule Path
Disallow /

dataprovider site explorer

Rule Path Comment
Disallow / Loading Performance for WoocommerceDisallow: /cart/
Disallow /checkout/ -
Disallow /my-account/ -
Disallow /*?orderby=price -
Disallow /*?orderby=rating -
Disallow /*?orderby=date -
Disallow /*?orderby=price-desc -
Disallow /*?orderby=popularity -
Disallow /*?filter -
Disallow /*?orderby=title -
Disallow /*?orderby=desc -
Disallow /*?filter -
Disallow /*add-to-cart%3D* -
Disallow /*add_to_wishlist%3D* -
Disallow /*?paged=&count=* -
Disallow /*?count=* Image Crawlability by search enginesUser-agent: *
Allow /*.png* -
Allow /*.jpg* -
Allow /*.gif* -
Allow /*.webp* Avoid crawler traps causing crawl budget issuesDisallow: /search/
Disallow *?s=* -
Disallow *?p=* -
Disallow *%26p%3D* -
Disallow *%26preview%3D* -
Disallow /search Social Media CrawlingUser-agent: facebookexternalhit/1.0
Allow / -

facebookexternalhit/1.1

Rule Path
Allow /

facebookplatform/1.0

Rule Path
Allow /

facebot/1.0

Rule Path
Allow /

visionutils/0.2

Rule Path
Allow /

datagnionbot

Rule Path
Allow /

pinterest/0.2

Rule Path Comment
Allow / Allow/Disallow Ads.txtUser-agent: *
Allow /ads.txt Allow/Disallow App-ads.txtUser-agent: *

Warnings

  • 26 invalid lines.