The module lifecycle stage: General Availability
The module has requirements for installation
v1.0.0
Changes
- Changes Observability module stage from Preview to General Availability
Enhancements
- Improved Russian CRD documentation and removed obsolete CRDs
- Added TCP/TLS in Alertmanager Gossip sync protocol.
Features
- Added lightweight logs in module Observability. It allows to select lightweight-logs datasource in Explore Data to explore all namespace pods logs in one place. It supports filtering logs by labels and annotations.
- Managed services monitoring support: it adds managed PostgreSQL or Memcache services monitoring dashboards to Explore Data section of a project.
- Added additional labels support for DKP alerts. This way project alerts may be
- Added eXpress messenger support in Alertmanager. Now it may be used when setting up notifications channel.
- Added auth config to notifications webhook channel
- Added ability to enable/disable Polk annotations in alerts notifications
Bug fixes
- Fixed write path for external metrics access in module documentation
- Fixed an issue that sometimes allowed upmeter probes test alerts to pass to general alerts list.
- Drop finalizers from the Deckhouse-deployed resources on module disable
- Remove the default value for the interval field in the metrics rule groups specs.
- Enables watch config for opagent daemonset to properly reload SA token
- Alertmanager Telegram and Slack notifier messages splitting for oversized alert groups
v0.9.8
Bug fixes
- Fixed templates to check if required modules are enabled instead of checking if their CRDs
v0.9.7
Bug fixes
- Fixed access to the prometheus remote write endpoint
v0.9.6
Bug fixes
- Fix CVEs related to CVE-2026-42151, CVE-2026-42154, CVE-2026-44903, CVE-2025-15558, CVE-2025-47914, CVE-2025-58181, CVE-2026-32952, stdlib (Go 1.25.10) and CVEs in bundled base images. Additional CVEs in
github.com/prometheus/prometheus,github.com/grafana/tempo,github.com/getkin/kin-openapi,github.com/grafana/grafana-plugin-sdk-goandgithub.com/grafana/grafanaare addressed via VEX statements as not exploitable.
v0.9.5
Bug fixes
- Removed the default value for the interval field in the metrics rule groups specs.
v0.9.4
Bug fixes
- Enabled watch config for opagent daemonset to properly reload SA token. That issue caused D8KubernetesStaleTokensDetected alerts for opagent.
v0.9.3
Bug fixes
- Fixed Prometheus discovery with global HA mode disabled.
v0.9.2
Bug fixes
- Fixed console dashboard panel links.
v0.9.1
Bug fixes
- Fixed vulnerabilities related to CVEs
v0.9.0
Features
- Added logs multitenancy support. Now explore data only shows current project / cluster logs. It allows to setup users access to specific namespaces logs only.
Changes
- High availability mode improved to keep firing and resolved alerts while module services rollouts and updates
- Replaced dashboard uniq UID validation with an alert that notifies about UID conflicts
- Improved module services authorization.
- Observability module documentation updated
Bug fixes
- Fixed naming convention validation for label names, annotation keys, and metrics names in observability rules to reject incompatible resources at apply time
- Fixed an issue caused projects monitoring status alerts and rules not displaying properly
- Fixed issues causing 401 responses affected dashboards with lots of panels and remote write metrics requests.
v0.8.2
Bug Fixes
- Fixed a bug in the Email notification channel where the “Require TLS” setting was ignored.
v0.8.1
Bug fixes
- Fixed vulnerabilities related to CVEs (CVE-2026-33186, CVE-2026-24051, CVE-2025-15558, CVE-2026-25679, CVE-2025-68470)
v0.8.0
Features
- Alerting high availability — when Deckhouse Kubernetes Platform runs in high availability mode it also runs alerting in HA mode for better availability.
- Improved Prometheus availability monitoring — DeadMansSwitch and PrometheusUnavailable alerts were updated for more reliable monitoring of Prometheus availability.
- Propagated notification policies — You can configure alert notification delivery per project using propagated notification policies together with propagated notification channels.
Bug Fixes
- Fixed empty rule group status in metrics rule groups caused by resource version conflicts.
- Fixed module enablement failing when the global
publicDomainTemplatesetting was left blank. - Fixed PromQL queries using negative regexp matcher (
!~) incorrectly returning “No data”. - Fixed incorrect graph rendering in Grafana when multiple conditions were applied to the same label in a PromQL query.
- Fixed errors when using a rule group custom resource with no rules defined.
- Cluster rule expressions now support an empty namespace for metrics selection.
Enhancements
- Deckhouse modules’ Prometheus rules are now stored in the
d8-observabilitynamespace. - Alerts from Prometheus rules that do not specify cluster/namespace scope are now listed under the corresponding namespace’s alerts instead of system alerts. This affects only rules not defined via ClusterObservabilityMetricsRulesGroup or ObservabilityMetricsRulesGroup.
- FAQ section on configuring external Grafana access to metrics was clarified.
Release v0.7.23
Bug fixes
- Fixed an issue caused some Grafana datasource plugins (ClickHouse) to fail to connect to databases.
- User roles permissions fixes
Release v0.7.22
Bug fixes
- Bump Go version to 1.25.6 to fix known vulnerabilities in stdlib
Release v0.7.21
Changes
- Observability alerts API service turned on for Kubernetes v1.32+
Release v0.7.20
Changes
- Expanded the Prometheus rules API
Release v0.7.19
Changes
- Observability webhook was moved to the control plane nodes. The component is now treated as critical for the cluster stability.
Release v0.7.18
Changes
- Removed internal module labels from prometheus recording rules.
Release v0.7.17
Changes
- Temporarily removed namespaces enforcing for cluster rules
Release v0.7.16
Bug fixes
- Fixed an issue which caused namespace deletion stuck.
Release v0.7.15
Changes
- Apiservice v1alpha1.alerts.observability.deckhouse.io turned off for Kubernetes v 1.32
Release v0.7.14
Bug fixes
- Fixed an issue which caused tenant alertmanager configuration load to fail.
Changes
- It now creates PrometheusRules in d8-observability namespace. Creating prometheus rules CRs in d8-monitoring namespace caused it to trigger D8CustomPrometheusRuleFoundInCluster alert.
v0.6.0
Features
- Added dark theme support for the dashboard visualization components. While the main UI has previously supported dark mode, this update extends theme awareness to full support in dashboards and graphs.
- Introduced a new kiosk mode for dashboards. When enabled, all navigation elements and menus are hidden, allowing dashboards to be displayed in full-screen mode — ideal for use on wall-mounted displays or TV screens.
Security
- Fixed vulnerabilities related to CVEs (CVE-2025-30153, CVE-2025-47912, CVE-2025-58183, CVE-2025-58185, CVE-2025-58186, CVE-2025-58187, CVE-2025-58188, CVE-2025-58189, CVE-2025-61723, CVE-2025-61724, CVE-2025-61725)
v0.5.6
Changes
- Changed module statuses from experimental to preview.
Bug Fixes
- Fixed an issue related to authorization token caching that caused incorrect triggering of the built-in alert: D8KubernetesStaleTokensDetected.
v0.5.5
Security
- Fixed vulnerabilities related to CVEs
v0.5.4
Changes
- Prepared for container v2 — added mount points to images as part of the transition groundwork, no impact on current usage.
v0.5.3
Bug Fixes
- Fixed incorrect link mutations in dashboards — previously, some dashboard links could be displayed incorrectly due to a mutation issue.
v0.5.2
Bug Fixes
- Fixed an issue related to authorization token caching that caused incorrect triggering of the built-in alert: D8KubernetesStaleTokensDetected.
v0.5.1
Bug Fixes
- Fixed an issue preventing correct exit from dashboard edit mode.
Features
- Added ability to remote write metrics to cluster Prometheus.