min-ahles-nest.de
robots.txt

Robots Exclusion Standard data for min-ahles-nest.de

Resource Scan

Scan Details

Site Domain min-ahles-nest.de
Base Domain min-ahles-nest.de
Scan Status Ok
Last Scan2024-06-23T17:22:14+00:00
Next Scan 2024-06-30T17:22:14+00:00

Last Scan

Scanned2024-06-23T17:22:14+00:00
URL http://min-ahles-nest.de/robots.txt
Domain IPs 91.234.171.66
Response IP 91.234.171.66
Found Yes
Hash 0468047997c5e284047d17c955e4f642eb3853434ce4a32a7a67b037d0e94317
SimHash 23211f5c0f35

Groups

*

Rule Path
Disallow /lightweight-ajax
Disallow /*?trafficsource
Disallow /suche/
Disallow /*?cmp=defrss
Disallow /test/
Disallow /hna-sieben/
Disallow /fdn/bootstrap/
Disallow /bi/bootstrap/
Disallow /bi/doop/
Disallow /sso/

xovi

Rule Path
Disallow /

sistrix

Rule Path
Disallow /

searchmetricsbot

Rule Path
Disallow /

bingbot

Rule Path
Disallow /test/
Disallow /hna-sieben/

gptbot

Rule Path
Allow /ueber-uns/
Disallow /

ccbot

Rule Path
Allow /ueber-uns/
Disallow /

msnbot

Rule Path
Disallow /test/
Disallow /hna-sieben/

Other Records

Field Value
crawl-delay 5

Other Records

Field Value
sitemap https://www.hna.de/news.xml

Comments

  • robots.txt www.hna.de
  • Legal notice: www.hna.de expressly reserves the right to use its content for commercial text and data mining (ยง 44b UrhG).
  • The use of robots or other automated means to access www.hna.de or collect or mine data without the express permission of www.hna.de is strictly prohibited.