📊 Observability Recipes

Monitor and observe your clusters with Prometheus, Grafana, logging pipelines, distributed tracing, and alerting configurations.

19 recipes available

Beginner

How to Set Up Container Logging

Implement effective logging strategies for Kubernetes containers. Configure log collection, aggregation, and analysis with various logging patterns.

⏱ 15 minutes K8s 1.28+

How to Use Kubernetes Events for Monitoring

Monitor cluster activity through Kubernetes events. Capture, filter, and alert on events for troubleshooting and operational visibility.

⏱ 15 minutes K8s 1.28+

Intermediate

Per-Tenant GPU Monitoring and Chargeback

Build per-tenant GPU monitoring dashboards with queue time, utilization, thermal metrics, and GPU-hour chargeback on Kubernetes.

⏱ 15 minutes K8s 1.28+

GPU Tenant SLO Observability

Define and monitor GPU tenant SLOs for queue time, inference latency, GPU utilization, and job completion rate with Prometheus alerting.

⏱ 15 minutes K8s 1.28+

OpenClaw Logging with EFK Stack

Collect and analyze OpenClaw agent logs using Elasticsearch, Fluent Bit, and Kibana for debugging and audit trails.

⏱ 15 minutes K8s 1.28+

Monitor OpenClaw with Prometheus and Grafana on Kubernetes

Set up monitoring for OpenClaw AI gateway on Kubernetes with Prometheus metrics, Grafana dashboards, and alerting for uptime, message throughput, and.

⏱ 20 minutes K8s 1.28+

Monitor NCCL Benchmark Runs with Prometheus and Grafana

Track NCCL benchmark outcomes and GPU telemetry over time with Prometheus and Grafana dashboards to detect communication regressions early.

⏱ 30 minutes K8s 1.28+

How to Set Up Node Problem Detector

Detect and report node-level issues automatically with Node Problem Detector. Learn to identify kernel problems, hardware failures, and container.

⏱ 20 minutes K8s 1.28+

How to Set Up Alertmanager for Prometheus

Configure Alertmanager to route and manage Prometheus alerts. Set up notification channels including Slack, PagerDuty, and email with routing rules.

⏱ 15 minutes K8s 1.28+

How to Implement Container Logging Patterns

Configure logging for Kubernetes applications. Implement sidecar logging, log aggregation, and structured logging best practices.

⏱ 15 minutes K8s 1.28+

How to Monitor Kubernetes with Grafana Dashboards

Create comprehensive Grafana dashboards for Kubernetes monitoring. Learn to visualize cluster, node, pod, and application metrics effectively.

⏱ 15 minutes K8s 1.28+

Jaeger Distributed Tracing on Kubernetes

Deploy Jaeger for distributed tracing in Kubernetes. Trace requests across microservices to identify latency issues and debug complex systems.

⏱ 15 minutes K8s 1.28+

How to Set Up Prometheus Monitoring

Deploy Prometheus for Kubernetes monitoring. Collect metrics from nodes, pods, and applications with ServiceMonitors and alerting rules.

⏱ 15 minutes K8s 1.28+

How to Monitor Kubernetes with Prometheus

Set up Prometheus monitoring for Kubernetes clusters. Configure scraping, alerting rules, and visualize metrics with Grafana dashboards.

⏱ 15 minutes K8s 1.28+

How to Configure Alertmanager for Kubernetes Alerts

Set up Alertmanager to route, group, and deliver Kubernetes alerts. Learn to configure Slack, PagerDuty, and email notifications.

⏱ 30 minutes K8s 1.28+

How to Set Up Prometheus Monitoring for Applications

Learn to instrument your Kubernetes applications with Prometheus metrics. Complete guide to ServiceMonitors, scraping configuration, and custom metrics.

⏱ 35 minutes K8s 1.28+

Want more observability patterns?

Our book includes an entire chapter dedicated to observability with dozens more examples.

📖 Explore All Chapters

📊 Observability Recipes

Beginner

How to Set Up Container Logging

How to Use Kubernetes Events for Monitoring

Intermediate

Per-Tenant GPU Monitoring and Chargeback

GPU Tenant SLO Observability

OpenClaw Logging with EFK Stack

Monitor OpenClaw with Prometheus and Grafana on Kubernetes

Monitor NCCL Benchmark Runs with Prometheus and Grafana

How to Set Up Node Problem Detector

How to Set Up Alertmanager for Prometheus

How to Implement Container Logging Patterns

How to Monitor Kubernetes with Grafana Dashboards

Jaeger Distributed Tracing on Kubernetes

How to Set Up Prometheus Monitoring

How to Monitor Kubernetes with Prometheus

How to Configure Alertmanager for Kubernetes Alerts

How to Set Up Prometheus Monitoring for Applications

Advanced

How to Implement Distributed Tracing with Jaeger

How to Set Up Centralized Logging with EFK Stack

How to Collect Metrics with OpenTelemetry Collector

Want more observability patterns?