📚Book Signing at KubeCon EU 2026Meet us at Booking.com HQ (Mon 18:30-21:00) & vCluster booth #521 (Tue 24 Mar, 12:30-1:30pm) — free book giveaway!RSVP Booking.com Event

📊 Observability Recipes

Monitor and observe your clusters with Prometheus, Grafana, logging pipelines, distributed tracing, and alerting configurations.

19 recipes available

Intermediate

Per-Tenant GPU Monitoring and Chargeback

Build per-tenant GPU monitoring dashboards with queue time, utilization, thermal metrics, and GPU-hour chargeback on Kubernetes.

⏱ 15 minutes K8s 1.28+

GPU Tenant SLO Observability

Define and monitor GPU tenant SLOs for queue time, inference latency, GPU utilization, and job completion rate with Prometheus alerting.

⏱ 15 minutes K8s 1.28+

OpenClaw Logging with EFK Stack

Collect and analyze OpenClaw agent logs using Elasticsearch, Fluent Bit, and Kibana for debugging and audit trails.

⏱ 15 minutes K8s 1.28+

Monitor OpenClaw with Prometheus and Grafana on Kubernetes

Set up monitoring for OpenClaw AI gateway on Kubernetes with Prometheus metrics, Grafana dashboards, and alerting for uptime, message throughput, and.

⏱ 20 minutes K8s 1.28+

Monitor NCCL Benchmark Runs with Prometheus and Grafana

Track NCCL benchmark outcomes and GPU telemetry over time with Prometheus and Grafana dashboards to detect communication regressions early.

⏱ 30 minutes K8s 1.28+

How to Set Up Node Problem Detector

Detect and report node-level issues automatically with Node Problem Detector. Learn to identify kernel problems, hardware failures, and container.

⏱ 20 minutes K8s 1.28+

How to Set Up Alertmanager for Prometheus

Configure Alertmanager to route and manage Prometheus alerts. Set up notification channels including Slack, PagerDuty, and email with routing rules.

⏱ 15 minutes K8s 1.28+

How to Implement Container Logging Patterns

Configure logging for Kubernetes applications. Implement sidecar logging, log aggregation, and structured logging best practices.

⏱ 15 minutes K8s 1.28+

How to Monitor Kubernetes with Grafana Dashboards

Create comprehensive Grafana dashboards for Kubernetes monitoring. Learn to visualize cluster, node, pod, and application metrics effectively.

⏱ 15 minutes K8s 1.28+

Jaeger Distributed Tracing on Kubernetes

Deploy Jaeger for distributed tracing in Kubernetes. Trace requests across microservices to identify latency issues and debug complex systems.

⏱ 15 minutes K8s 1.28+

How to Set Up Prometheus Monitoring

Deploy Prometheus for Kubernetes monitoring. Collect metrics from nodes, pods, and applications with ServiceMonitors and alerting rules.

⏱ 15 minutes K8s 1.28+

How to Monitor Kubernetes with Prometheus

Set up Prometheus monitoring for Kubernetes clusters. Configure scraping, alerting rules, and visualize metrics with Grafana dashboards.

⏱ 15 minutes K8s 1.28+

How to Configure Alertmanager for Kubernetes Alerts

Set up Alertmanager to route, group, and deliver Kubernetes alerts. Learn to configure Slack, PagerDuty, and email notifications.

⏱ 30 minutes K8s 1.28+

How to Set Up Prometheus Monitoring for Applications

Learn to instrument your Kubernetes applications with Prometheus metrics. Complete guide to ServiceMonitors, scraping configuration, and custom metrics.

⏱ 35 minutes K8s 1.28+

Want more observability patterns?

Our book includes an entire chapter dedicated to observability with dozens more examples.

📖 Explore All Chapters
Luca Berton Ansible Pilot Ansible by Example Open Empower K8s Recipes Terraform Pilot CopyPasteLearn ProteinLens