Monitoring
Stack
- Prometheus: Metrics collection via kube-prometheus-stack Helm chart
- Grafana: Dashboards and alerting — accessible at
grafana.akze.net - Loki: Log aggregation — queried through Grafana datasource
- Hubble: Cilium network observability — accessible at
hubble.akze.net
Key Dashboards
- Node resource usage (CPU, memory, disk)
- Pod restart and crash tracking
- Traefik request rates and error codes
- CrowdSec ban activity