Skip to main content

Cluster Live Monitoring

Cluster live monitoring in a CodeNOW Dataplane is built on Grafana dashboards backed by Mimir as the metrics data source. Metrics from all workloads and cluster infrastructure are collected and stored in Mimir, and are visualized through pre-built Grafana dashboards.

Grafana Dashboards with Mimir

Grafana provides the dashboarding layer for real-time cluster metrics. Mimir is a scalable, long-term storage backend for Prometheus-compatible metrics. Dashboards expose metrics collected by Prometheus from cluster workloads and infrastructure components.

Key capabilities available through the Grafana dashboards:

  • Monitor resource utilization (CPU, memory, network, storage) per namespace, workload, and pod
  • Track request rates, error rates, and latency for deployed services
  • View cluster-level infrastructure metrics (node health, scheduler, etcd, etc.)
  • Use Mimir's PromQL-compatible query interface for ad-hoc metric exploration

For reference on querying metrics, see the Mimir documentation.

Access

Cluster Live Monitoring dashboards are accessible per Dataplane — the Self-Service Portal links directly to the Grafana instance on the relevant Dataplane. Each Dataplane maintains its own Mimir instance, keeping metrics data strictly isolated between clusters.