Use force unmount and explicitly unmount bad mount points #183

dabradley · 2024-10-16T12:04:52Z

There have been cases where the logic to cleanup a mount point has caused the driver to get into a bad state. This is most obvious when a subdirectory is mounted to a volume and a parent directory of that subdirectory is deleted. The Lustre driver doesn't handle that case in the way that Kubernetes expects and returns invalid data. To avoid this scenario causing our driver to get into a bad state, leak mount points, etc, we must explicitly check that we can read the necessary information about the mount point, and if not, explicitly unmount that mount point before allowing Kubernetes to clean up the directory. To ensure that we don't end up in a bad state, this change enables force unmounting as well. The force unmount will only occur after a timeout has expired, since force unmounts can cause issues with the Lustre driver. However, in this case, it is better if we are in a bad enough situation to be able to eventually return to a good state rather than require manual intervention.

What type of PR is this?

/kind bug

There have been cases where the logic to cleanup a mount point has caused the driver to get into a bad state. This is most obvious when a subdirectory is mounted to a volume and a parent directory of that subdirectory is deleted. The Lustre driver doesn't handle that case in the way that Kubernetes expects and returns invalid data. To avoid this scenario causing our driver to get into a bad state, leak mount points, etc, we must explicitly check that we can read the necessary information about the mount point, and if not, explicitly unmount that mount point before allowing Kubernetes to clean up the directory. To ensure that we don't end up in a bad state, this change enables force unmounting as well. The force unmount will only occur after a timeout has expired, since force unmounts can cause issues with the Lustre driver. However, in this case, it is better if we are in a bad enough situation to be able to eventually return to a good state rather than require manual intervention.

k8s-ci-robot · 2024-10-16T12:05:07Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: dabradley

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~OWNERS~~ [dabradley]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

coveralls · 2024-10-16T12:11:55Z

coverage: 81.844% (-0.9%) from 82.773%
when pulling 9297cb4 on dabradley:personal/dabradley/cleanupbadmounts
into d583bcd on kubernetes-sigs:development.

dabradley requested a review from t-mialve October 16, 2024 12:04

k8s-ci-robot added do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. kind/bug Categorizes issue or PR as related to a bug. cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. labels Oct 16, 2024

k8s-ci-robot requested review from andyzhangx and vinli-cn October 16, 2024 12:05

k8s-ci-robot added size/M Denotes a PR that changes 30-99 lines, ignoring generated files. approved Indicates a PR has been approved by an approver from all required OWNERS files. labels Oct 16, 2024

dabradley removed request for andyzhangx and vinli-cn October 16, 2024 12:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use force unmount and explicitly unmount bad mount points #183

Use force unmount and explicitly unmount bad mount points #183

dabradley commented Oct 16, 2024

k8s-ci-robot commented Oct 16, 2024

coveralls commented Oct 16, 2024

Use force unmount and explicitly unmount bad mount points #183

Are you sure you want to change the base?

Use force unmount and explicitly unmount bad mount points #183

Conversation

dabradley commented Oct 16, 2024

k8s-ci-robot commented Oct 16, 2024

coveralls commented Oct 16, 2024