ignite.apache.org
robots.txt

Robots Exclusion Standard data for ignite.apache.org

Resource Scan

Scan Details

Site Domain ignite.apache.org
Base Domain apache.org
Scan Status Ok
Last Scan2024-04-25T02:34:53+00:00
Next Scan 2024-05-25T02:34:53+00:00

Last Scan

Scanned2024-04-25T02:34:53+00:00
URL https://ignite.apache.org/robots.txt
Domain IPs 151.101.2.132, 2a04:4e42::644
Response IP 151.101.2.132
Found Yes
Hash 900be765ca3a794a1fcb3354cdc4fb34c39df2f8b7c9fc9839e5315dab883955
SimHash 4906f43bff71

Groups

*

Rule Path
Allow /docs/latest/
Disallow /docs/
Allow /releases/latest/
Disallow /releases/
Disallow /search?q=
Disallow */*_sc_token%3D
Disallow */*_x_tr_sl%3D
Disallow */*_x_tr_tl%3D
Disallow */*_x_tr_hl%3D
Disallow */*_x_tr_pto%3D
Disallow */*force_isolation%3D
Disallow */*Preferred%3D
Disallow */*utm_
Disallow */*placement%3D
Disallow */*yhid%3D
Disallow */*clid%3D
Disallow */?fbclid=
Disallow */?tpclid=
Disallow /*?_ym_debug
Disallow /*?calltouch_tm
Disallow */?source=
Disallow */?web_view=
Disallow */?event=
Disallow */?PageSpeed=
Disallow */?ref=
Disallow /?from=
Disallow /?_ga=

Other Records

Field Value
sitemap https://ignite.apache.org/sitemap.xml

Warnings

  • `host` is not a known field.