Files
NowChessSystems/modules/coordinator
Janis 960a419792
Build & Test (NowChessSystems) TeamCity build failed
fix: force-delete hanging pods and remove failed instances from registry
When pod deletion fails, instances remained in registry with state=DEAD,
preventing scale-down since avgLoad calculation counted them. Now:

- Use gracePeriod(0) for immediate pod deletion instead of 30s wait
  (prevents cascade when nodes are down or pods stuck terminating)
- Remove instance from registry on deletion failure anyway
  (prevents dead instances from blocking scale-down via avgLoad)

This breaks the cycle: failed deletions → scaleUp → max replicas →
more failures → more stuck instances blocking recovery.
2026-05-14 09:57:29 +02:00
..
2026-05-13 22:24:05 +00:00
2026-05-13 22:24:05 +00:00