fix volumeAttachment leak when kube-controller restarts during the execution of DetachVolume #130516

goushicui · 2025-03-01T16:42:15Z

What type of PR is this?

/kind bug

What this PR does / why we need it:

Which issue(s) this PR fixes:

When the kube-controller-manager restarts during the execution of DetachVolume, orphaned volumeAttachment objects may persist in the API server, leading to resource leaks. This occurs due to inconsistencies between node status updates and volumeAttachment cleanup logic during controller recovery.

Workflow Leading to Leak:

DetachVolume Initiation

The volume is removed from node.status.volumeAttached before DetachVolume execution.

Controller Restart

If kube-controller-manager restarts at this point, attach_detach_controller rebuilds the actualStateOfWorld cache by iterating over node.Status.VolumesAttached. Since the volume was already removed from the node status, it is not added to the cache.

Orphaned volumeAttachment Handling

During processVolumeAttachments, the controller checks if the volume exists in actualStateOfWorld with AttachStateDetached:

                attachState := adc.actualStateOfWorld.GetAttachState(volumeName, nodeName)
		if attachState == cache.AttachStateDetached {
		  err = adc.actualStateOfWorld.MarkVolumeAsUncertain(logger, volumeName, volumeSpec, nodeName)
		}

Because the volume is absent from the cache (due to step 2), the orphaned volumeAttachment is not re-added to actualStateOfWorld, resulting in a persistent leak.

Fixes #

Special notes for your reviewer:

Does this PR introduce a user-facing change?

Additional documentation e.g., KEPs (Kubernetes Enhancement Proposals), usage docs, etc.:

…, it may result in a volumeAttachment leak

k8s-ci-robot · 2025-03-01T16:42:17Z

Adding the "do-not-merge/release-note-label-needed" label because no release-note block was detected, please follow our release note process to remove it.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

k8s-ci-robot · 2025-03-01T16:42:23Z

This issue is currently awaiting triage.

If a SIG or subproject determines this is a relevant issue, they will accept it by applying the triage/accepted label and provide further guidance.

The triage/accepted label can be added by org members by writing /triage accepted in a comment.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

k8s-ci-robot · 2025-03-01T16:42:24Z

Hi @goushicui. Thanks for your PR.

I'm waiting for a kubernetes member to verify that this patch is reasonable to test. If it is, they should reply with /ok-to-test on its own line. Until that is done, I will not automatically test new commits in this PR, but the usual testing commands by org members will still work. Regular contributors should join the org to skip this step.

Once the patch is verified, the new status will be reflected by the ok-to-test label.

I understand the commands that are listed here.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

goushicui · 2025-03-01T16:47:43Z

@gnufied

k8s-ci-robot · 2025-03-01T20:43:41Z

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by: goushicui, vahan-sahakyan-op
Once this PR has been reviewed and has the lgtm label, please assign thockin for approval. For more information see the Code Review Process.

The full list of commands accepted by this bot can be found here.

Needs approval from an approver in each of these files:

pkg/controller/volume/OWNERS

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

goushicui · 2025-03-02T05:37:22Z

/assign @gnufied

mauriciopoppe · 2025-03-03T15:30:58Z

/uncc

goushicui · 2025-03-07T02:02:54Z

/assgin @thockin

goushicui · 2025-03-07T02:03:59Z

/assign @thockin

gnufied · 2025-03-07T12:50:38Z

/ok-to-test

k8s-ci-robot · 2025-03-07T13:39:17Z

@goushicui: The following test failed, say /retest to rerun all failed tests or /retest-required to rerun all mandatory failed tests:

Test name	Commit	Details	Required	Rerun command
pull-kubernetes-unit	`9479aef`	link	true	`/test pull-kubernetes-unit`

Full PR test history. Your PR dashboard. Please help us cut down on flakes by linking to an open issue when you hit one in your PR.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. I understand the commands that are listed here.

gnufied · 2025-03-07T13:44:33Z

Shouldn't that volume be detached by external-attacher anyways regardless of restart of KCM? Once, deletionTimestamp is set on a VA object, it will get detached by external-attacher.

What exactly did we leak in this case? Are you saying, volume is not detaching in this case?

goushicui · 2025-03-08T04:04:08Z

@gnufied Yes, the kube-controller restarted before the volumeAttachment started deleting.

gnufied · 2025-03-08T12:34:03Z

@gnufied Yes, the kube-controller restarted before the volumeAttachment started deleting.

But you didn't answer rest of my question.

goushicui · 2025-03-08T14:44:35Z

Shouldn't that volume be detached by external-attacher anyways regardless of restart of KCM? Once, deletionTimestamp is set on a VA object, it will get detached by external-attacher.

@gnufied Are you referring to this issue? I suggest you take a closer look at the description above. If deletionTimestamp is set on the volumeAttachment, the external-attacher can indeed perform the detachment operation in a timely manner. However, what if the deletion call fails, or if the KCM (Kubernetes Controller Manager) restarts before detachVolume is executed due to serial processing of the volume?
If it is not re-added here, neither the desired cache nor the actual cache will contain information about this volume. How would the subsequent reconcile operation perform the so-called deletion then?

carlory · 2025-03-10T03:51:58Z

kcm rebuild the asw from node and va objects, please see https://github.com/carlory/kubernetes/blob/master/pkg/controller/volume/attachdetach/attach_detach_controller.go#L683

goushicui · 2025-03-10T05:39:26Z

https://github.com/carlory/kubernetes/blob/master/pkg/controller/volume/attachdetach/attach_detach_controller.go#L737

attachState := adc.actualStateOfWorld.GetAttachState(volumeName, nodeName)
		if attachState == cache.AttachStateDetached {

Look at the judgment condition here? It has already been removed from the node status attachedvolume before detach. Do you think this judgment can hold? @carlory

carlory · 2025-03-10T07:28:00Z

This check is correct. If the asw has populated the volume from the node status, the expected state of the volume is attached. if not but found it in the va object, it means that the volume is uncertain. we can not say it is attached or detached. reconciler will take care of it and mark it as attached or detached later. It is no problem.

carlory · 2025-03-10T07:31:15Z

it is not added to the cache.

It is not correct. MarkVolumeAsUncertain adds the volume to asw but its state is marked as Uncertain

goushicui · 2025-03-10T07:40:36Z

err = rc.actualStateOfWorld.RemoveVolumeFromReportAsAttached(attachedVolume.VolumeName, attachedVolume.NodeName)
			if err != nil {
				logger.V(5).Info("RemoveVolumeFromReportAsAttached failed while removing volume from node",
					"node", klog.KRef("", string(attachedVolume.NodeName)),
					"volumeName", attachedVolume.VolumeName,
					"err", err)
			}

			// Update Node Status to indicate volume is no longer safe to mount.
			err = rc.nodeStatusUpdater.UpdateNodeStatusForNode(logger, attachedVolume.NodeName)

kCM restart, I want to ask how the volume information in ASW can be obtained through reconciliation. If it cannot be retrieved here, how should it be set to Uncertain status?

 If the asw has populated the volume from the node status, the expected state of the volume is attached.

@carlory

carlory · 2025-03-10T07:49:01Z

https://github.com/carlory/kubernetes/blob/aab083972dbb5620b6daa62172aa1694e85facd7/pkg/controller/volume/attachdetach/attach_detach_controller.go#L347

If the kcm is restarted, the ADC controller will rebuild its cache before it starts reconciler. If the volume is removed from node's attachedVolumes but the va object still exists, the ADC controller will add the volume to asw and mark it as uncertain. After asw and dsw are populated, the reconciler is started. it will compare the asw and dsw, and then re-do detach operation.

carlory · 2025-03-10T07:51:16Z

If the volume has already been removed from the node status, it will not trigger a node status update.

goushicui · 2025-03-10T08:06:34Z

@carlory
I see you keep emphasizing that populateActualStateOfWorld -> processVolumeAttachments will handle this logic... As I mentioned earlier, if it is removed from node.status.volumeAttached, the execution logic of populateActualStateOfWorld is as follows:

for _, node := range nodes {
    nodeName := types.NodeName(node.Name)

    for _, attachedVolume := range node.Status.VolumesAttached {
        uniqueName := attachedVolume.Name

First question: Won't this be added to the asw cache here?

attachState := adc.actualStateOfWorld.GetAttachState(volumeName, nodeName)
    if attachState == cache.AttachStateDetached {

Second question: Can it be marked as Uncertain here?

Third question: If it is not marked as Uncertain, will reconcile continue to process this volumeattachment?

Could you please refer to the code and answer the above questions 1, 2, and 3 respectively? Thank you.

carlory · 2025-03-10T08:23:39Z

First question: Won't this be added to the asw cache here?

It won't add the volume to cache. If the volume can be found in node status, it means that the volume should be added to asw and its state is attached.

Second question: Can it be marked as Uncertain here?

Yes, it should be Uncertain. we don't know whether the detach operation is called. If the detach operation is called and fails due to timeout, the volume may be detached. If not, the volume is attached. So it's state is Uncertain. We can not mark the attached volume as Uncertain if the volume is found in the node status. So we need this check.

Third question: If it is not marked as Uncertain, will reconcile continue to process this volume attachment?

No. If it is not in aws and dsw, the VA won't be handled. the reconciler doesn't know the VA concept.

goushicui · 2025-03-10T08:32:13Z

Yes, it should be Uncertain. we don't know whether the detach operation is called. If the detach operation is called and fails due to timeout, the volume may be detached. If not, the volume is attached. So it's state is Uncertain. We can not mark the attached volume as Uncertain if the volume is found in the node status. So we need this check.

@carlory I am not asking whether it should be marked as Uncertain, but rather whether the attachState := adc.actualStateOfWorld.GetAttachState(volumeName, nodeName) can be retrieved from the cache here. Can we proceed further?

goushicui · 2025-03-20T12:45:21Z

/assign jsafrane

goushicui · 2025-04-08T08:10:46Z

@yuga711

k8s-triage-robot · 2025-07-07T09:09:44Z

The Kubernetes project currently lacks enough contributors to adequately respond to all PRs.

This bot triages PRs according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the PR is closed

You can:

Mark this PR as fresh with /remove-lifecycle stale
Close this PR with /close
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle stale

k8s-ci-robot · 2025-07-12T13:12:08Z

PR needs rebase.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

k8s-triage-robot · 2025-08-11T13:57:47Z

The Kubernetes project currently lacks enough active contributors to adequately respond to all PRs.

This bot triages PRs according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the PR is closed

You can:

Mark this PR as fresh with /remove-lifecycle rotten
Close this PR with /close
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle rotten

During the execution of DetachVolume, if the kube-controller restarts…

9479aef

…, it may result in a volumeAttachment leak

k8s-ci-robot added the needs-ok-to-test Indicates a PR that requires an org member to verify it is safe to test. label Mar 1, 2025

github-project-automation bot added this to SIG Apps Mar 1, 2025

github-project-automation bot moved this to Needs Triage in SIG Apps Mar 1, 2025

k8s-ci-robot requested review from gnufied and mauriciopoppe March 1, 2025 16:42

vahan-sahakyan-op approved these changes Mar 1, 2025

View reviewed changes

k8s-ci-robot assigned gnufied Mar 2, 2025

k8s-ci-robot removed the request for review from mauriciopoppe March 3, 2025 15:31

k8s-ci-robot assigned thockin Mar 7, 2025

k8s-ci-robot added ok-to-test Indicates a non-member PR verified by an org member that is safe to test. and removed needs-ok-to-test Indicates a PR that requires an org member to verify it is safe to test. labels Mar 7, 2025

k8s-ci-robot assigned jsafrane Mar 20, 2025

k8s-ci-robot added lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. labels Jul 7, 2025

k8s-ci-robot added lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. and removed lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. labels Aug 11, 2025

fix volumeAttachment leak when kube-controller restarts during the execution of DetachVolume #130516

Are you sure you want to change the base?

fix volumeAttachment leak when kube-controller restarts during the execution of DetachVolume #130516

Conversation

goushicui commented Mar 1, 2025

What type of PR is this?

What this PR does / why we need it:

Which issue(s) this PR fixes:

Special notes for your reviewer:

Does this PR introduce a user-facing change?

Additional documentation e.g., KEPs (Kubernetes Enhancement Proposals), usage docs, etc.:

Uh oh!

k8s-ci-robot commented Mar 1, 2025

Uh oh!

k8s-ci-robot commented Mar 1, 2025

Uh oh!

k8s-ci-robot commented Mar 1, 2025

Uh oh!

goushicui commented Mar 1, 2025

Uh oh!

k8s-ci-robot commented Mar 1, 2025

Uh oh!

goushicui commented Mar 2, 2025

Uh oh!

mauriciopoppe commented Mar 3, 2025

Uh oh!

goushicui commented Mar 7, 2025

Uh oh!

goushicui commented Mar 7, 2025

Uh oh!

gnufied commented Mar 7, 2025

Uh oh!

k8s-ci-robot commented Mar 7, 2025

Uh oh!

gnufied commented Mar 7, 2025

Uh oh!

goushicui commented Mar 8, 2025

Uh oh!

gnufied commented Mar 8, 2025

Uh oh!

goushicui commented Mar 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

carlory commented Mar 10, 2025

Uh oh!

goushicui commented Mar 10, 2025

Uh oh!

carlory commented Mar 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

carlory commented Mar 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

goushicui commented Mar 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

carlory commented Mar 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

carlory commented Mar 10, 2025

Uh oh!

goushicui commented Mar 10, 2025

Uh oh!

carlory commented Mar 10, 2025

Uh oh!

goushicui commented Mar 10, 2025

Uh oh!

goushicui commented Mar 20, 2025

Uh oh!

goushicui commented Apr 8, 2025

Uh oh!

k8s-triage-robot commented Jul 7, 2025

Uh oh!

k8s-ci-robot commented Jul 12, 2025

Uh oh!

k8s-triage-robot commented Aug 11, 2025

Uh oh!

Uh oh!

goushicui commented Mar 8, 2025 •

edited

Loading

carlory commented Mar 10, 2025 •

edited

Loading

carlory commented Mar 10, 2025 •

edited

Loading

goushicui commented Mar 10, 2025 •

edited

Loading

carlory commented Mar 10, 2025 •

edited

Loading