pub.dakloifarwa.de
robots.txt

Robots Exclusion Standard data for pub.dakloifarwa.de

Resource Scan

Scan Details

Site Domain pub.dakloifarwa.de
Base Domain dakloifarwa.de
Scan Status Ok
Last Scan2025-12-21T15:29:46+00:00
Next Scan 2025-12-22T15:29:46+00:00

Last Scan

Scanned2025-12-21T15:29:46+00:00
URL https://pub.dakloifarwa.de/robots.txt
Domain IPs 94.16.112.50
Response IP 94.16.112.50
Found Yes
Hash 3cb91ec906a874fe0bddbd39964833e9e9b0005a571ce0ba48bf0469142b06f2
SimHash b75e5a514086

Groups

addsearchbot
ai2bot
ai2bot-deepresearcheval
ai2bot-dolma
aihitbot
amazon-kendra
amazonbot
amazonbuyforme
andibot
anomura
anthropic-ai
applebot
applebot-extended
atlassian-bot
awario
bedrockbot
bigsur.ai
bravebot
brightbot 1.0
buddybot
bytespider
ccbot
channel3bot
chatglm-spider
chatgpt agent
chatgpt-user
claude-searchbot
claude-user
claude-web
claudebot
cloudflare-autorag
cloudvertexbot
cohere-ai
cohere-training-data-crawler
cotoyogi
crawl4ai
crawlspace
datenbank crawler
deepseekbot
devin
diffbot
duckassistbot
echobot bot
echoboxbot
facebookbot
facebookexternalhit
factset_spyderbot
firecrawlagent
friendlycrawler
gemini-deep-research
google-cloudvertexbot
google-extended
google-firebase
google-notebooklm
googleagent-mariner
googleother
googleother-image
googleother-video
gptbot
iaskbot
iaskspider
iaskspider/2.0
iboubot
icc-crawler
imagesiftbot
imagespider
img2dataset
isscyberriskcrawler
kangaroo bot
klaviyoaibot
kunatocrawler
laion-huggingface-processor
laiondownloader
lcc
linerbot
linguee bot
linkupbot
manus-user
meta-externalagent
meta-externalagent
meta-externalfetcher
meta-externalfetcher
meta-webindexer
mistralai-user
mistralai-user/1.0
mycentralaiscraperbot
netestate imprint crawler
notebooklm
novaact
oai-searchbot
omgili
omgilibot
openai
operator
pangubot
panscient
panscient.com
perplexity-user
perplexitybot
petalbot
phindbot
poggio-citations
poseidon research crawler
qualifiedbot
quillbot
quillbot.com
sbintuitionsbot
scrapy
semrushbot-ocob
semrushbot-swa
shapbot
sidetrade indexer bot
spider
tavilybot
terracotta
thinkbot
tiktokspider
timpibot
twinagent
velenpublicwebcrawler
wardbot
webzio-extended
webzio-extended
wpbot
wrtnbot
yak
yandexadditional
yandexadditionalbot
youbot
zanistabot

Rule Path
Disallow /