Skip to content

Monitoring

Stack

  • Prometheus: Metrics collection via kube-prometheus-stack Helm chart
  • Grafana: Dashboards and alerting — accessible at grafana.akze.net
  • Loki: Log aggregation — queried through Grafana datasource
  • Hubble: Cilium network observability — accessible at hubble.akze.net

Key Dashboards

  • Node resource usage (CPU, memory, disk)
  • Pod restart and crash tracking
  • Traefik request rates and error codes
  • CrowdSec ban activity