SUSE Support

Here When You Need Us

Node Maintenance Guideline

This document (000021085) is provided subject to the disclaimer at the end of this document.

Situation

This instruction describes how to handle planned node maintenance.

Resolution

These are the steps needed to shut down all volumes without having to modify the scale of the end user/application deployments in the cluster.

  • Cordon the node. Longhorn will automatically disable the node scheduling when a Kubernetes node is cordoned.
  • Drain the node to move the workload to somewhere else. You will need to use any of the following documentation for the respective commands depending on the version:
    • For Longhorn before 1.4.1 (doc)
    • For Longhorn 1.4.3+ (doc) - drain command has been simplified
This will evict the workloads from the draining node.
  • The replica processes on the node will be stopped at this stage. Replicas on the node will be shown as Failed.
Note: By default, if there is one last healthy replica for a volume on the node, Longhorn will prevent the node from completing the drain operation, to protect the last replica and prevent the disruption of the workload. You can either override the behavior in the setting, or evict the replica to other nodes before draining.
  • The engine processes on the node will be migrated with the Pod to other nodes. 
Note: If there are Longhorn volumes that are manually attached by Longhorn UI on the node, Longhorn will prevent the node from completing the drain operation, please detach these volume using Longhorn UI.
  • After the drain is completed, there should be no engine or replica process running on the node. Two instance managers will still be running on the node, but they’re stateless and won’t cause interruption to the existing workload.
  • Perform the necessary maintenance, including shutting down or rebooting the node.
  • Uncordon the node. Longhorn will automatically re-enable the node scheduling.
Important note 1: Always refer to the respective product documentation for clear steps and details. The KB content is only a pointer to the main aspects of the current issue being addressed.
Important note 2: Always refer to the documentation of the specific product version. Many features and functions are changed so for the most accurate information, you have to ensure you're looking into the right version documentation.

Additional Information

Disclaimer

This Support Knowledgebase provides a valuable tool for SUSE customers and parties interested in our products and solutions to acquire information, ideas and learn from one another. Materials are provided for informational, personal or non-commercial use within your organization and are presented "AS IS" WITHOUT WARRANTY OF ANY KIND.

  • Document ID:000021085
  • Creation Date: 25-May-2023
  • Modified Date:02-Nov-2023
    • SUSE Rancher Longhorn

< Back to Support Search

For questions or concerns with the SUSE Knowledgebase please contact: tidfeedback[at]suse.com

tick icon

SUSE Support Forums

Get your questions answered by experienced Sys Ops or interact with other SUSE community experts.

tick icon

Support Resources

Learn how to get the most from the technical support you receive with your SUSE Subscription, Premium Support, Academic Program, or Partner Program.

tick icon

Open an Incident

Open an incident with SUSE Technical Support, manage your subscriptions, download patches, or manage user access.