cfcdn.aerzteblatt.de
robots.txt

Robots Exclusion Standard data for cfcdn.aerzteblatt.de

Resource Scan

Scan Details

Site Domain cfcdn.aerzteblatt.de
Base Domain aerzteblatt.de
Scan Status Ok
Last Scan2024-11-02T20:57:46+00:00
Next Scan 2024-11-16T20:57:46+00:00

Last Scan

Scanned2024-11-02T20:57:46+00:00
URL https://cfcdn.aerzteblatt.de/robots.txt
Domain IPs 108.157.254.120, 108.157.254.125, 108.157.254.15, 108.157.254.64, 2600:9000:2816:1a00:1a:72d5:5180:93a1, 2600:9000:2816:3e00:1a:72d5:5180:93a1, 2600:9000:2816:6e00:1a:72d5:5180:93a1, 2600:9000:2816:9a00:1a:72d5:5180:93a1, 2600:9000:2816:c200:1a:72d5:5180:93a1, 2600:9000:2816:da00:1a:72d5:5180:93a1, 2600:9000:2816:ea00:1a:72d5:5180:93a1, 2600:9000:2816:fa00:1a:72d5:5180:93a1
Response IP 108.157.254.120
Found Yes
Hash d34ad8c8ec2c6161b0c88a6c36951c6bc1b9dc9346805b9f79d46a2ad0b642b1
SimHash 62301c324d6f

Groups

*

Rule Path
Disallow /nachrichten/*?*page=*
Disallow /archiv/*?*page=*
Disallow /werbung/click.asp*
Disallow /*.pdf
Disallow /old/*
Disallow /cms/*
Disallow /intern/*
Disallow /callback/*
Disallow /suche*
Disallow /treffer*
Disallow /archiv/suche*
Disallow /archiv/treffer*
Disallow /anzeigen/
Allow /anzeigen/praxis-abgabe
Allow /anzeigen/praxis-gesuche
Allow /anzeigen/assoziationen
Allow /anzeigen/praxis-niederlassung
Allow /anzeigen/praxis-raeume
Allow /anzeigen/praxis-einrichtung-bedarf
Allow /anzeigen/praxis-ausland

Other Records

Field Value
crawl-delay 1

chatgpt-user

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

ccbot/1.0

Rule Path
Disallow /

ccbot/2.0

Rule Path
Disallow /

ccbot/3.0

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.aerzteblatt.de/sitemaps/dae.xml

Comments

  • OpenAI ChatGPT
  • Google Bard
  • Common Crawl
  • Legal notice: aerzteblatt.de expressly reserves the right to use its content for commercial text and data mining (ยง 44 b UrhG).
  • The use of robots or other automated means to access aerzteblatt.de or collect or mine data without
  • the express permission of aerzteblatt.de is strictly prohibited.
  • aerzteblatt.de may, in its discretion, permit certain automated access to certain aerzteblatt.de pages.
  • If you would like to apply for permission to crawl aerzteblatt.de, collect or use data, please email aerzteblatt@aerzteblatt.de