fix(aula-08): prevenir volume stalling com CSI tolerations e PDB

- Adicionar hcloud-csi-values.yaml com tolerations para node failures
- Configurar 2 replicas do CSI controller para HA
- Criar statefulset-pdb.yaml para proteger StatefulSets durante drain
- Documentar troubleshooting de volumes stuck no README
This commit is contained in:
ArgoCD Setup
2026-01-23 18:45:00 -03:00
parent 9f96e97205
commit 2480c82944
4 changed files with 74 additions and 3 deletions

View File

@@ -0,0 +1,13 @@
# Configuracoes para graceful handling de node failures
controller:
tolerations:
- key: "node.kubernetes.io/unreachable"
operator: "Exists"
effect: "NoExecute"
tolerationSeconds: 60
- key: "node.kubernetes.io/not-ready"
operator: "Exists"
effect: "NoExecute"
tolerationSeconds: 60
# Mais replicas para HA do controller
replicaCount: 2