die-glocke.de
robots.txt

Robots Exclusion Standard data for die-glocke.de

Resource Scan

Scan Details

Site Domain die-glocke.de
Base Domain die-glocke.de
Scan Status Ok
Last Scan2024-09-27T02:39:22+00:00
Next Scan 2024-10-04T02:39:22+00:00

Last Scan

Scanned2024-09-27T02:39:22+00:00
URL https://die-glocke.de/robots.txt
Redirect https://www.die-glocke.de/robots.txt
Redirect Domain www.die-glocke.de
Redirect Base die-glocke.de
Domain IPs 83.223.64.122
Redirect IPs 83.223.64.122
Response IP 83.223.64.122
Found Yes
Hash acb1fb66bad63e76b4c7dcd1f121ce0660b14ffa1c02f43d0b10456d53d9473c
SimHash e2121958adb1

Groups

*

Rule Path
Allow /

*

Rule Path
Disallow /*?tx_kesearch_pi1

bytespider

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

diffbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

omgili

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.die-glocke.de/rss/google-news
sitemap https://www.die-glocke.de/sitemap.xml?sitemap=pages

Comments

  • Legal notice: die-glocke.de expressly reserves the right to use its content for commercial text and data mining (§ 44b UrhG).
  • The use of robots or other automated means to access die-glocke.de or collect or mine data without the express permission of die-glocke.de is strictly prohibited.
  • If you would like to apply for permission to crawl die-glocke.de, collect or use data, please contact redaktionsleitung@die-glocke.de