obermain.de
robots.txt

Robots Exclusion Standard data for obermain.de

Resource Scan

Scan Details

Site Domain obermain.de
Base Domain obermain.de
Scan Status Ok
Last Scan2024-07-05T23:29:28+00:00
Next Scan 2024-07-12T23:29:28+00:00

Last Scan

Scanned2024-07-05T23:29:28+00:00
URL https://www.obermain.de/robots.txt
Domain IPs 82.211.32.82
Response IP 82.211.32.82
Found Yes
Hash 9d1b291a42ead90cf501532b4cdbb0944bb69995db72c8dae441dfa6064f1a0e
SimHash ba3657180df7

Groups

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

Comments

  • ID: 1
  • Legal notice: obermain.de expressly reserves the right to use its content for commercial text and data mining (ยง 44b UrhG).
  • The use of robots or other automated means to access obermain.de or collect or mine data without the express permission of obermain.de is strictly prohibited.
  • If you would like to apply for permission to crawl obermain.de, collect or use data, please contact syndication@mainpost.de
  • OpenAI
  • Google Bard
  • Common Crawl Foundation

Warnings

  • 9 invalid lines.