main-ding.de
robots.txt

Robots Exclusion Standard data for main-ding.de

Resource Scan

Scan Details

Site Domain main-ding.de
Base Domain main-ding.de
Scan Status Ok
Last Scan2024-07-06T17:40:26+00:00
Next Scan 2024-07-13T17:40:26+00:00

Last Scan

Scanned2024-07-06T17:40:26+00:00
URL https://www.main-ding.de/robots.txt
Domain IPs 82.211.32.82
Response IP 82.211.32.82
Found Yes
Hash 257eedd4a2a5f97764f030dc735207aa04637afa3192b93e6d80fd6a02d019af
SimHash 2a1b3e1838b5

Groups

*

Rule Path
Disallow /admin/
Disallow /_/ecards.html*
Disallow /login/
Disallow /shoutbox/
Disallow /intern/
Disallow /ted/
Disallow /ads/
Disallow /neun7alt/
Disallow /abfall/
Disallow /ads/
Disallow /external/
Disallow /bayerndeal/
Disallow /leoadressen/
Disallow /leoevtman/
Disallow /leoartikel/
Disallow /leokinoadressen/
Disallow /fotoserie/
Disallow /member/
Disallow /register/
Disallow /videos/
Disallow /myhonkytonksw/
Disallow /myhonkytonkwue/
Disallow /myhonkytonk/
Disallow /ps_test/
Disallow /fcms/
Disallow /*?_FRAME=33&_FORMAT=PRINT$
Disallow /*?_FRAME=*
Disallow /_/tools/diaview.html?prev=true*
Disallow /_/tools/pdfpage.html
Disallow /_/tools/pdfpage.html*
Disallow /_/tools/pdfpage.html?arid=*
Disallow /*admin*/
Disallow /_/sendmail.html*
Disallow /_/tools/bb_redirect.html*

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

Comments

  • ID: 1
  • Legal notice: main-ding.de expressly reserves the right to use its content for commercial text and data mining (ยง 44b UrhG).
  • The use of robots or other automated means to access main-ding.de or collect or mine data without the express permission of main-ding.de is strictly prohibited.
  • If you would like to apply for permission to crawl main-ding.de, collect or use data, please contact syndication@mainpost.de
  • OpenAI
  • Google Bard
  • Common Crawl Foundation