infothek-gesundheit.de
robots.txt

Robots Exclusion Standard data for infothek-gesundheit.de

Resource Scan

Scan Details

Site Domain infothek-gesundheit.de
Base Domain infothek-gesundheit.de
Scan Status Ok
Last Scan2025-04-25T06:38:14+00:00
Next Scan 2025-05-25T06:38:14+00:00

Last Scan

Scanned2025-04-25T06:38:14+00:00
URL https://infothek-gesundheit.de/robots.txt
Domain IPs 193.26.157.84, 2a03:4000:4c:27:98dc:20ff:fe69:155b
Response IP 193.26.157.84
Found Yes
Hash 69c8287bc3f5964c01d66656d7052808f6685de90dff098f0d0d252fac7afa31
SimHash 48cc6bc0ed06

Groups

*

Rule Path
Allow /
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php
Disallow /README.txt

claude

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

amazon-kendra-web-crawler-*

Product Comment
amazon-kendra-web-crawler-* all customers of Amazon Kendra's web crawler
Rule Path Comment
Disallow / disallow everything

imagesiftbot

Rule Path
Disallow /

zoominfobot

Rule Path
Disallow /

morningscore bot

Rule Path
Disallow /

semrushbot-si

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

mail.ru_bot

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /