The module lifecycle stageGeneral Availability
The module has requirements for installation

v1.0.0

Changes

  • Changes Observability module stage from Preview to General Availability

Enhancements

  • Improved Russian CRD documentation and removed obsolete CRDs
  • Added TCP/TLS in Alertmanager Gossip sync protocol.

Features

  • Added lightweight logs in module Observability. It allows to select lightweight-logs datasource in Explore Data to explore all namespace pods logs in one place. It supports filtering logs by labels and annotations.
  • Managed services monitoring support: it adds managed PostgreSQL or Memcache services monitoring dashboards to Explore Data section of a project.
  • Added additional labels support for DKP alerts. This way project alerts may be
  • Added eXpress messenger support in Alertmanager. Now it may be used when setting up notifications channel.
  • Added auth config to notifications webhook channel
  • Added ability to enable/disable Polk annotations in alerts notifications

Bug fixes

  • Fixed write path for external metrics access in module documentation
  • Fixed an issue that sometimes allowed upmeter probes test alerts to pass to general alerts list.
  • Drop finalizers from the Deckhouse-deployed resources on module disable
  • Remove the default value for the interval field in the metrics rule groups specs.
  • Enables watch config for opagent daemonset to properly reload SA token
  • Alertmanager Telegram and Slack notifier messages splitting for oversized alert groups

v0.9.8

Bug fixes

  • Fixed templates to check if required modules are enabled instead of checking if their CRDs

v0.9.7

Bug fixes

  • Fixed access to the prometheus remote write endpoint

v0.9.6

Bug fixes

  • Fix CVEs related to CVE-2026-42151, CVE-2026-42154, CVE-2026-44903, CVE-2025-15558, CVE-2025-47914, CVE-2025-58181, CVE-2026-32952, stdlib (Go 1.25.10) and CVEs in bundled base images. Additional CVEs in github.com/prometheus/prometheus, github.com/grafana/tempo, github.com/getkin/kin-openapi, github.com/grafana/grafana-plugin-sdk-go and github.com/grafana/grafana are addressed via VEX statements as not exploitable.

v0.9.5

Bug fixes

  • Removed the default value for the interval field in the metrics rule groups specs.

v0.9.4

Bug fixes

  • Enabled watch config for opagent daemonset to properly reload SA token. That issue caused D8KubernetesStaleTokensDetected alerts for opagent.

v0.9.3

Bug fixes

  • Fixed Prometheus discovery with global HA mode disabled.

v0.9.2

Bug fixes

  • Fixed console dashboard panel links.

v0.9.1

Bug fixes

  • Fixed vulnerabilities related to CVEs

v0.9.0

Features

  • Added logs multitenancy support. Now explore data only shows current project / cluster logs. It allows to setup users access to specific namespaces logs only.

Changes

  • High availability mode improved to keep firing and resolved alerts while module services rollouts and updates
  • Replaced dashboard uniq UID validation with an alert that notifies about UID conflicts
  • Improved module services authorization.
  • Observability module documentation updated

Bug fixes

  • Fixed naming convention validation for label names, annotation keys, and metrics names in observability rules to reject incompatible resources at apply time
  • Fixed an issue caused projects monitoring status alerts and rules not displaying properly
  • Fixed issues causing 401 responses affected dashboards with lots of panels and remote write metrics requests.

v0.8.2

Bug Fixes

  • Fixed a bug in the Email notification channel where the “Require TLS” setting was ignored.

v0.8.1

Bug fixes

  • Fixed vulnerabilities related to CVEs (CVE-2026-33186, CVE-2026-24051, CVE-2025-15558, CVE-2026-25679, CVE-2025-68470)

v0.8.0

Features

  • Alerting high availability — when Deckhouse Kubernetes Platform runs in high availability mode it also runs alerting in HA mode for better availability.
  • Improved Prometheus availability monitoring — DeadMansSwitch and PrometheusUnavailable alerts were updated for more reliable monitoring of Prometheus availability.
  • Propagated notification policies — You can configure alert notification delivery per project using propagated notification policies together with propagated notification channels.

Bug Fixes

  • Fixed empty rule group status in metrics rule groups caused by resource version conflicts.
  • Fixed module enablement failing when the global publicDomainTemplate setting was left blank.
  • Fixed PromQL queries using negative regexp matcher (!~) incorrectly returning “No data”.
  • Fixed incorrect graph rendering in Grafana when multiple conditions were applied to the same label in a PromQL query.
  • Fixed errors when using a rule group custom resource with no rules defined.
  • Cluster rule expressions now support an empty namespace for metrics selection.

Enhancements

  • Deckhouse modules’ Prometheus rules are now stored in the d8-observability namespace.
  • Alerts from Prometheus rules that do not specify cluster/namespace scope are now listed under the corresponding namespace’s alerts instead of system alerts. This affects only rules not defined via ClusterObservabilityMetricsRulesGroup or ObservabilityMetricsRulesGroup.
  • FAQ section on configuring external Grafana access to metrics was clarified.

Release v0.7.23

Bug fixes

  • Fixed an issue caused some Grafana datasource plugins (ClickHouse) to fail to connect to databases.
  • User roles permissions fixes

Release v0.7.22

Bug fixes

  • Bump Go version to 1.25.6 to fix known vulnerabilities in stdlib

Release v0.7.21

Changes

  • Observability alerts API service turned on for Kubernetes v1.32+

Release v0.7.20

Changes

  • Expanded the Prometheus rules API

Release v0.7.19

Changes

  • Observability webhook was moved to the control plane nodes. The component is now treated as critical for the cluster stability.

Release v0.7.18

Changes

  • Removed internal module labels from prometheus recording rules.

Release v0.7.17

Changes

  • Temporarily removed namespaces enforcing for cluster rules

Release v0.7.16

Bug fixes

  • Fixed an issue which caused namespace deletion stuck.

Release v0.7.15

Changes

  • Apiservice v1alpha1.alerts.observability.deckhouse.io turned off for Kubernetes v 1.32

Release v0.7.14

Bug fixes

  • Fixed an issue which caused tenant alertmanager configuration load to fail.

Changes

  • It now creates PrometheusRules in d8-observability namespace. Creating prometheus rules CRs in d8-monitoring namespace caused it to trigger D8CustomPrometheusRuleFoundInCluster alert.

v0.6.0

Features

  • Added dark theme support for the dashboard visualization components. While the main UI has previously supported dark mode, this update extends theme awareness to full support in dashboards and graphs.
  • Introduced a new kiosk mode for dashboards. When enabled, all navigation elements and menus are hidden, allowing dashboards to be displayed in full-screen mode — ideal for use on wall-mounted displays or TV screens.

Security

  • Fixed vulnerabilities related to CVEs (CVE-2025-30153, CVE-2025-47912, CVE-2025-58183, CVE-2025-58185, CVE-2025-58186, CVE-2025-58187, CVE-2025-58188, CVE-2025-58189, CVE-2025-61723, CVE-2025-61724, CVE-2025-61725)

v0.5.6

Changes

  • Changed module statuses from experimental to preview.

Bug Fixes

v0.5.5

Security

  • Fixed vulnerabilities related to CVEs

v0.5.4

Changes

  • Prepared for container v2 — added mount points to images as part of the transition groundwork, no impact on current usage.

v0.5.3

Bug Fixes

  • Fixed incorrect link mutations in dashboards — previously, some dashboard links could be displayed incorrectly due to a mutation issue.

v0.5.2

Bug Fixes

v0.5.1

Bug Fixes

  • Fixed an issue preventing correct exit from dashboard edit mode.

Features

  • Added ability to remote write metrics to cluster Prometheus.