lup.be
robots.txt
Robots Exclusion Standard data for lup.be
Resource Scan
Scan Details
Site Domain | lup.be |
Base Domain | lup.be |
Scan Status | Ok |
Last Scan | 2025-08-12T21:03:59+00:00 |
Next Scan | 2025-08-26T21:03:59+00:00 |
Last Scan
Scanned | 2025-08-12T21:03:59+00:00 |
URL | https://lup.be/robots.txt |
Domain IPs | 162.159.134.42 |
Response IP | 162.159.134.42 |
Found | Yes |
Hash | 2d65443b4395875a86455763e1f1926a7bf6fa511ff4eb5fc72109e308e596e9 |
SimHash | 545c4311d680 |
Groups
*
No rules defined. All paths allowed.
Other Records
Field | Value |
---|---|
crawl-delay | 7 |
*
Rule | Path |
---|---|
Disallow | /wp-admin/ |
Disallow | /? |
ai2bot
ai2bot-dolma
aihitbot
amazonbot
andibot
anthropic-ai
applebot
applebot-extended
brightbot 1.0
bytespider
ccbot
chatgpt-user
claude-searchbot
claude-user
claude-web
claudebot
cohere-ai
cohere-training-data-crawler
cotoyogi
crawlspace
diffbot
duckassistbot
facebookbot
factset_spyderbot
firecrawlagent
friendlycrawler
google-cloudvertexbot
google-extended
googleother
googleother-image
googleother-video
gptbot
iaskspider/2.0
icc-crawler
imagesiftbot
img2dataset
isscyberriskcrawler
kangaroo bot
meta-externalagent
meta-externalagent
meta-externalfetcher
meta-externalfetcher
mistralai-user/1.0
novaact
oai-searchbot
omgili
omgilibot
operator
pangubot
panscient
panscient.com
perplexity-user
perplexitybot
petalbot
phindbot
qualifiedbot
quillbot
quillbot.com
sbintuitionsbot
scrapy
semrushbot-ocob
semrushbot-swa
sidetrade indexer bot
tiktokspider
timpibot
velenpublicwebcrawler
webzio-extended
wpbot
yandexadditional
yandexadditionalbot
youbot
Rule | Path |
---|---|
Disallow | / |
aitcsroboti
accoona
admantx
admantx-usaspb
adbeat_bot
aihitbot
amazonbot
arachnophilia
aspiegelbot
awariosmartbot
backdoorbot
backrub
baidu
blexbot
blexbot
becomebot
blowfishi
bomborabot
catchbot
ccbot
cherrypicker
claudebot
clickagy
cliqzbot
coccocbot
converacrawler
contxbot
crowdtanglebot
cyberspyder
dotbot
echoboxbot
emailcollector
exabot
eyeotabot
findlinks
foobot
genieo
geturl
gigabot
grapeshotcrawler
gumgum
httrack
huaweisymantecspider
iascrawler
imagesiftbot
jikespider
jobboerse
java
jyxobot
leikibot
linkscan
linkisbot
linkdexbot
linkfluence.com
livelapbot
mail.ru_bot
mauibot
mazbot
mbcrawler
megaindex.ru
mj12bot
mojeekbot
mtbot/1.1.0i
nerdybot
nimbostratus-bot
ntentbot
offline explorer
onespot-scraperbot
openbot
outclicksbot
paperlibot
perl
petalbot
plurkbot
proximic
proximi
python
quantcastboti
qwantify
scholarbot
scrap
screaming frog seo spider
semantici
sentibot
seokicks
seokicks-robot
serendeputybot
serpstatbot
seznambot
sitecheck-sitecrawl
sitesnagger
snooper
sogou
sosospider
superbot
taboolabot
teleportpro
tkbot
ttd-content
tweetmemebot
urlspiderpro
vagabondo
velenpublicwebcrawler
voilabot
voluumdsp-content-bot
webcopier
weborama-fetcher
webreaper
webstripper
webzip
xaldon_webspider
yak
yandex
yandexbot
yandeximages
zgrab
zoominfobot
scrapy
buck
tinytestbot
semrushbot
ahrefsbot
petalbot
mj12bot
dotbot
mauibot
yandexbot
baiduspider
barkrowler
bytespider
whatstuffwherebot
applebot
sogou pic spider/3.0( http://www.sogou.com/docs/help/webmasters.htm
sogou head spider/3.0( http://www.sogou.com/docs/help/webmasters.htm
sogou web spider/4.0(+http://www.sogou.com/docs/help/webmasters.htm
user-agent: sogou orion spider/3.0( http://www.sogou.com/docs/help/webmasters.htm
sogou-test-spider/4.0 (compatible; msie 5.5; windows 98)
mozilla/5.0 (compatible; konqueror/3.5; linux) khtml/3.5.5 (like gecko) (exabot-thumbnails)
mozilla/5.0 (compatible; exabot/3.0; +http://www.exabot.com/go/robot)
swiftbot
slurp
ccbot/2.0 (https://commoncrawl.org/faq/)
ccbot/2.0
ccbot/2.0 (http://commoncrawl.org/faq/)
Rule | Path |
---|---|
Disallow | / |
Comments