blablog.de
robots.txt

Robots Exclusion Standard data for blablog.de

Resource Scan

Scan Details

Site Domain blablog.de
Base Domain blablog.de
Scan Status Ok
Last Scan2025-06-04T16:48:08+00:00
Next Scan 2025-07-04T16:48:08+00:00

Last Scan

Scanned2025-06-04T16:48:08+00:00
URL https://blablog.de/robots.txt
Domain IPs 212.53.214.210
Response IP 212.53.214.210
Found Yes
Hash 66f2e5f4f4a66470c630ee7b45f3a18f0254663b655471661d1cd6cf91d99847
SimHash 6006495104f1

Groups

*

Rule Path
Disallow /feeds/
Disallow /bundled-libs/
Disallow /deployment/
Disallow /docs/
Disallow /htmlarea/
Disallow /include/
Disallow /lang/
Disallow /plugins/
Disallow /sql/
Disallow /templates_c/
Disallow /*.php$
Disallow /*.js$
Disallow /*.inc$
Disallow /*.tar$
Disallow /*.tgz$
Disallow /*.sh$
Disallow /*.zip$
Disallow /*.tpl$
Disallow /pages/impressum.html
Disallow /pages/datenschutz.html

amazonbot
anthropic-ai
applebot-extended
bytespider
ccbot
chatgpt-user
claudebot
claude-web
cohere-ai
diffbot
facebookbot
friendlycrawler
googlebot-image
google-extended
googleother
googleother-image
googleother-video
gptbot
imagesiftbot
img2dataset
omgili
omgilibot
perplexitybot
youbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://blablog.de/sitemap.xml