daniel-siepmann.de
robots.txt

Robots Exclusion Standard data for daniel-siepmann.de

Resource Scan

Scan Details

Site Domain daniel-siepmann.de
Base Domain daniel-siepmann.de
Scan Status Ok
Last Scan2024-05-17T17:21:29+00:00
Next Scan 2024-06-16T17:21:29+00:00

Last Scan

Scanned2024-05-17T17:21:29+00:00
URL https://daniel-siepmann.de/robots.txt
Domain IPs 194.36.147.177, 2a03:4000:4d:d84:98f5:87ff:fe5c:1d21
Response IP 194.36.147.177
Found Yes
Hash c5d81724237b61b9ca22b6baecf0a4611f07145af5a1e471e7b9206e0d566a3c
SimHash 70185b40e3a0

Groups

*

Rule Path
Disallow /fileadmin/
Disallow /archive/

adsbot-google
amazonbot
anthropic-ai
applebot
awariorssbot
awariosmartbot
bytespider
ccbot
chatgpt-user
claudebot
claude-web
cohere-ai
dataforseobot
facebookbot
google-extended
googleother
gptbot
imagesiftbot
magpie-crawler
meltwater
omgili
omgilibot
peer39_crawler
peer39_crawler/1.0
perplexitybot
seekr
youbot

Rule Path
Disallow /