960a419792
Build & Test (NowChessSystems) TeamCity build failed
When pod deletion fails, instances remained in registry with state=DEAD, preventing scale-down since avgLoad calculation counted them. Now: - Use gracePeriod(0) for immediate pod deletion instead of 30s wait (prevents cascade when nodes are down or pods stuck terminating) - Remove instance from registry on deletion failure anyway (prevents dead instances from blocking scale-down via avgLoad) This breaks the cycle: failed deletions → scaleUp → max replicas → more failures → more stuck instances blocking recovery.