druid.io
robots.txt

Robots Exclusion Standard data for druid.io

Resource Scan

Scan Details

Site Domain druid.io
Base Domain druid.io
Scan Status Ok
Last Scan2025-12-20T18:59:35+00:00
Next Scan 2026-01-19T18:59:35+00:00

Last Scan

Scanned2025-12-20T18:59:35+00:00
URL https://druid.io/robots.txt
Redirect https://druid.apache.org/robots.txt
Redirect Domain druid.apache.org
Redirect Base apache.org
Domain IPs 3.222.246.181, 34.225.226.218
Redirect IPs 151.101.2.132, 2a04:4e42::644
Response IP 151.101.2.132
Found Yes
Hash 79ca6763f5b577133a31cd3ce12464eabc1a2d7dbdb3840977c31efab81a7b9d
SimHash 76b41f43c37b

Groups

*

Rule Path
Disallow /docs/0*/
Disallow /docs/1*/
Disallow /docs/2*/
Disallow /docs/3*/
Disallow /docs/4*/
Disallow /docs/5*/
Disallow /docs/6*/
Disallow /docs/7*/
Disallow /docs/8*/
Disallow /docs/9*/
Disallow /blog*

Comments

  • robots.txt for https://druid.apache.org
  • Keep robots from crawling old Druid doc versions
  • and the unused blog endpoint