Simplified access to the NVIDIA CUDA toolkit on SUSE Linux for HPC
Overview
The High-Performance Computing industry is rapidly embracing the use of AI and ML technology in addition to legacy parallel computing. Heterogeneous Computing, the use of both CPUs and accelerators like graphics processing units (GPUs), has become increasingly more common and GPUs from NVIDIA are the most popular accelerators used today for AI/ML workloads. To get the full advantage of NVIDIA GPUs, you need to use the CUDA parallel computing platform and programming toolkit. The CUDA Toolkit includes GPU-accelerated libraries, a compiler, development tools and the CUDA runtime. To get the full advantage of NVIDIA GPUs, you need to use NVIDIA CUDA, which is a general purpose parallel computing platform and programming model for NVIDIA GPUs. The NVIDIA CUDA Toolkit includes GPU-accelerated libraries, a compiler, development tools and the CUDA runtime. CUDA supports the SUSE Linux operating system distributions (both SUSE Enterprise and OpenSUSE) and NVIDIA provides a repository with the necessary packages to easily install the CUDA Toolkit and NVIDIA drivers on SUSE. To simplify installation of NVIDIA CUDA Toolkit on SUSE Linux Enterprise for High Performance Computing (SLE HPC) 15, we have included a new SUSE Module, NVIDIA Compute Module 15. This Module adds the NVIDIA CUDA network repository to your SLE HPC system. You can select it at installation time or activate it post installation. This module is available for use with all SLE HPC 15 Service Packs. Note that the NVIDIA Compute Module 15 is currently only available for the SLE HPC 15 product.Post-Installation Process via Yast
Installing via Yast- Start Yast and select System Extensions
- After YaST checks the registration for the system, a list of modules that are installed or available is displayed.
- Information on the EULA for the CUDA drivers is displayed.
- You must trust the GnuPG key for the CUDA repository.
- You will be given one more confirmation screen
- After adding the repository, you can install the CUDA drivers.
- A large number of packages will be installed
Summary
Managing heterogeneous computing environments has become increasingly important for HPC and AI/ML administrators. The NVIDIA Compute Module is one way we are working to make using these technologies easier to use.(Visited 37 times, 1 visits today)
Related Articles
Dec 12th, 2023
Intel® TDX Support Coming to SUSE Linux Enterprise Server
Jan 19th, 2024
Security Controls for the OWASP Kubernetes Top 10
Aug 29th, 2024
No comments yet