Node Maintenance Guideline
This document (000021085) is provided subject to the disclaimer at the end of this document.
Situation
Resolution
These are the steps needed to shut down all volumes without having to modify the scale of the end user/application deployments in the cluster.
- Cordon the node. Longhorn will automatically disable the node scheduling when a Kubernetes node is cordoned.
- Drain the node to move the workload to somewhere else. You will need to use any of the following documentation for the respective commands depending on the version:
- The replica processes on the node will be stopped at this stage. Replicas on the node will be shown as
Failed
.
Note: By default, if there is one last healthy replica for a volume on the node, Longhorn will prevent the node from completing the drain operation, to protect the last replica and prevent the disruption of the workload. You can either override the behavior in the setting, or evict the replica to other nodes before draining.
- The engine processes on the node will be migrated with the Pod to other nodes.
Note: If there are Longhorn volumes that are manually attached by Longhorn UI on the node, Longhorn will prevent the node from completing the drain operation, please detach these volume using Longhorn UI.
- After the
drain
is completed, there should be no engine or replica process running on the node. Two instance managers will still be running on the node, but they’re stateless and won’t cause interruption to the existing workload. - Perform the necessary maintenance, including shutting down or rebooting the node.
- Uncordon the node. Longhorn will automatically re-enable the node scheduling.
Important note 2: Always refer to the documentation of the specific product version. Many features and functions are changed so for the most accurate information, you have to ensure you're looking into the right version documentation.
Additional Information
- Cordoning a Node - SUSE Rancher documentation
- Draining a Node - SUSE Rancher documentation
- Safely Drain a Node - Kubernetes documentation
- Node Mantenance - Longhorn
Disclaimer
This Support Knowledgebase provides a valuable tool for SUSE customers and parties interested in our products and solutions to acquire information, ideas and learn from one another. Materials are provided for informational, personal or non-commercial use within your organization and are presented "AS IS" WITHOUT WARRANTY OF ANY KIND.
- Document ID:000021085
- Creation Date: 25-May-2023
- Modified Date:02-Nov-2023
-
- SUSE Rancher Longhorn
For questions or concerns with the SUSE Knowledgebase please contact: tidfeedback[at]suse.com