hcm-magazin.de
robots.txt

Robots Exclusion Standard data for hcm-magazin.de

Resource Scan

Scan Details

Site Domain hcm-magazin.de
Base Domain hcm-magazin.de
Scan Status Ok
Last Scan2024-05-24T14:12:58+00:00
Next Scan 2024-05-31T14:12:58+00:00

Last Scan

Scanned2024-05-24T14:12:58+00:00
URL https://hcm-magazin.de/robots.txt
Domain IPs 87.106.113.225
Response IP 87.106.113.225
Found Yes
Hash c51e249ac2b2913a700c8be2c96e9990c394dd78535335d538ca08a102a5ea7d
SimHash b3224d026dd3

Groups

*

Rule Path
Disallow /*search%3D*

gptbot

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

*

Rule Path
Disallow */download/magazines/Heftarchiv_*.pdf

ahrefsbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

ezooms

Rule Path
Disallow /

turnitinbot

Rule Path
Disallow /

Comments

  • robots.txt
  • Suche raus
  • Heftarchiv-PDFs raus
  • Legal notice: hcm-magazin.de expressly reserves the right to use its content for commercial text and data mining (ยง 44 b UrhG).
  • The use of robots or other automated means to access hcm-magazin.de or collect or mine data without the express permission of hcm-magazin.de is strictly prohibited.
  • If you would like to apply for permission to crawl hcm-magazin.de, collect or use data, please email produktmanager@holzmann-medien.de