daniel-siepmann.de
robots.txt
Robots Exclusion Standard data for daniel-siepmann.de
Resource Scan
Scan Details
Site Domain | daniel-siepmann.de |
Base Domain | daniel-siepmann.de |
Scan Status | Ok |
Last Scan | 2024-05-17T17:21:29+00:00 |
Next Scan | 2024-06-16T17:21:29+00:00 |
Last Scan
Scanned | 2024-05-17T17:21:29+00:00 |
URL | https://daniel-siepmann.de/robots.txt |
Domain IPs | 194.36.147.177, 2a03:4000:4d:d84:98f5:87ff:fe5c:1d21 |
Response IP | 194.36.147.177 |
Found | Yes |
Hash | c5d81724237b61b9ca22b6baecf0a4611f07145af5a1e471e7b9206e0d566a3c |
SimHash | 70185b40e3a0 |
Groups
*
Rule | Path |
---|---|
Disallow | /fileadmin/ |
Disallow | /archive/ |
adsbot-google
amazonbot
anthropic-ai
applebot
awariorssbot
awariosmartbot
bytespider
ccbot
chatgpt-user
claudebot
claude-web
cohere-ai
dataforseobot
facebookbot
google-extended
googleother
gptbot
imagesiftbot
magpie-crawler
meltwater
omgili
omgilibot
peer39_crawler
peer39_crawler/1.0
perplexitybot
seekr
youbot
Rule | Path |
---|---|
Disallow | / |