-
Notifications
You must be signed in to change notification settings - Fork 1.5k
KCP does not remove the etcd member for a machine that failed to join the control plane #13221
Copy link
Copy link
Open
Labels
area/provider/control-plane-kubeadmIssues or PRs related to KCPIssues or PRs related to KCPhelp wantedDenotes an issue that needs help from a contributor. Must meet "help wanted" guidelines.Denotes an issue that needs help from a contributor. Must meet "help wanted" guidelines.kind/bugCategorizes issue or PR as related to a bug.Categorizes issue or PR as related to a bug.priority/important-soonMust be staffed and worked on either currently, or very soon, ideally in time for the next release.Must be staffed and worked on either currently, or very soon, ideally in time for the next release.triage/acceptedIndicates an issue or PR is ready to be actively worked on.Indicates an issue or PR is ready to be actively worked on.
Metadata
Metadata
Assignees
Labels
area/provider/control-plane-kubeadmIssues or PRs related to KCPIssues or PRs related to KCPhelp wantedDenotes an issue that needs help from a contributor. Must meet "help wanted" guidelines.Denotes an issue that needs help from a contributor. Must meet "help wanted" guidelines.kind/bugCategorizes issue or PR as related to a bug.Categorizes issue or PR as related to a bug.priority/important-soonMust be staffed and worked on either currently, or very soon, ideally in time for the next release.Must be staffed and worked on either currently, or very soon, ideally in time for the next release.triage/acceptedIndicates an issue or PR is ready to be actively worked on.Indicates an issue or PR is ready to be actively worked on.
What steps did you take and what happened?
kubeadm join --control-planeruns, it adds an etcd member as a learner, tries to promote it, and eventually fails, because etcd is not running, since kubelet is not running. The etcd member remains unstarted.kubeadm join --control-planeruns, it will try to add a member as a learner, and that will fail, because no etcd members can be added, while there is one unstarted etcd member.What did you expect to happen?
KCP should remove the unstarted etcd member that was added for the machine that failed to join the cluster.
Cluster API version
v1.10.7
Kubernetes version
v1.34.2
Anything else you would like to add?
This issue came out of a discussion in kubernetes/kubeadm#3269 (comment)
Label(s) to be applied
/kind bug
/area provider/control-plane-kubeadm