Skip to content

fix: configure Talos cluster for proper HA with VIP#30

Merged
winlinuxmatt merged 1 commit into
mainfrom
fix/talos-ha-configuration
Dec 25, 2025
Merged

fix: configure Talos cluster for proper HA with VIP#30
winlinuxmatt merged 1 commit into
mainfrom
fix/talos-ha-configuration

Conversation

@winlinuxmatt
Copy link
Copy Markdown
Owner

Summary

This PR fixes the Talos cluster configuration for proper high availability.

Changes

  • Add cluster_vip variable (10.0.0.69) for HA API server endpoint
  • Update all control plane configs to use VIP as cluster_endpoint
  • Add VIP config_patches to control plane nodes for automatic failover
  • Update all worker configs to use VIP instead of individual CP IPs
  • Add kubernetes_version to worker configs for consistency
  • Fix kubernetes_version from 1.34.0 to 1.32.0 (stable)
  • Update talosconfig endpoints to include VIP and all CP nodes

Problem

Previously, each worker pointed to a different control plane IP:

  • Worker 1 → CP 1
  • Worker 2 → CP 2
  • Worker 3 → CP 3

This caused instability when individual control planes were unavailable, as workers couldn't failover to other CPs.

Solution

All nodes now use a shared VIP (10.0.0.69) that floats between control plane nodes, providing true HA for the Kubernetes API server.

Testing

  • All nodes healthy and running v1.32.0
  • VIP active and responding
  • talosctl health passes on all control planes
  • All pods running successfully

- Add cluster_vip variable (10.0.0.69) for HA API server endpoint
- Update all control plane configs to use VIP as cluster_endpoint
- Add VIP config_patches to control plane nodes for automatic failover
- Update all worker configs to use VIP instead of individual CP IPs
- Add kubernetes_version to worker configs for consistency
- Fix kubernetes_version from 1.34.0 to 1.32.0 (stable)
- Update talosconfig endpoints to include VIP and all CP nodes

This fixes issues where workers pointed to different control planes,
causing instability when individual CPs were unavailable.
@winlinuxmatt winlinuxmatt merged commit 013029e into main Dec 25, 2025
1 check passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant