Pre-installed Modules¶
Most software available on Cheaha is installed as modules, managed by the Lmod system. This document will provide a basic rundown of using Lmod commands to customize a software environment. module
is the main command used to interface with module files in Lmod.
As of the most recent update of this page there are 4,445 active modules installed on Cheaha. The most commonly used general research software modules are listed in the table below. Read on to learn more about searching for and loading modules. If you can't find what you need in our modules, learn more about getting software installed. If you need further assistance, please contact Support.
Name | Description |
---|---|
Anaconda3 | Software that can install the Python language, Python packages, and other research software. Learn more about using Anaconda at our Anaconda on Cheaha page. You may be interested in our OpenOnDemand Jupyter Notebook interactive app. |
CUDA, cuDNN | Libraries for developing and using deep learning and AI models with NVidia GPUs. Commonly used with TensorFlow and PyTorch. See our GPU page for more information. |
Mathematica | Mathematical CAS and numerical computing software. Try our Open OnDemand HPC Desktop interactive app. |
Matlab | Matlab language and development environment. We recommend using our Open OnDemand Matlab interactive app. |
R, Rstudio | R language and RStudio IDE. We recommend using our Open OnDemand RStudio interactive app. |
SAS | Statistical analysis software. Try our Open OnDemand HPC Desktop interactive app. |
Singularity | Software container engine. See our Containers page for more information. |
Stata | Statistical analysis software. Try our Open OnDemand HPC Desktop interactive app. |
Listing and Searching Modules¶
To begin, all module commands are run from the terminal. To know what software is installed on Cheaha, use the avail
command.
If you need to know what software is already loaded in your environment, run:
If there is specific software you want to search for, you can use the spider
subcommand, and provide a string or regular expression to match against. All modules containing the string (case-insensitive) or matching the regular expression will be returned along with their installed versions.
# list modules containing string
module spider <string>
# list modules matching a regular expression
module -r spider <regex>
Loading Modules¶
To load modules, run:
Note
If you only specify a module name without an accompanying version tag, the most recently installed version will be loaded into the workspace. If your scripts depend on specific versions of software being used, explicitly load the module version you need.
To unload packages, run:
If you want to revert to the default modules, you can use:
Saving Modules using Collections¶
To save time in typing in long list of modules everytime you work on a project, you can save the desired list of modules using module collection. To acheive this, load the desired modules and save them to a collection using a module collection name, as shown below.
Here, the collection_name
can be something relevant to your project and easy to remember.
To load the desired modules using the saved collection use,
To delete a collection use the below command,
To list the save list of module collection use,
Warning
Using module save
command without a collection name saves the desired modules in the name default
to the location $HOME/.lmod.d/default, and causes issue in launching Open On Demand (OOD) HPC desktop job. The user gets a VNC error such as, Unable to contact settings server
and/or Unable to load a failsafe session
. To address this issue, it is recommended to follow the instructions outlined in the FAQ entry.
Best Practice for Loading Modules¶
When using modules in Cheaha, we recommend users to follow these best practices to avoid any potential module conflicts, reduce unexpected behavior and/or to get rid of Lmod errors:
-
Avoid using
module load
in$HOME/.bashrc
. Instead, create a bash script with the module load commands and source it each time to load the modules needed in a shell/sbatch script. Here is an example of loading module in a bash script namedmodule_test.sh
and compilation,#!/bin/bash module reset module load Bowtie/1.1.2-foss-2016a module load SAMtools/1.3.1-foss-2016a module load TopHat/2.1.1-foss-2016a module -t list
$ chmod +x module_test.sh $ source ./module_test.sh Resetting modules to system default # Currently Loaded Modules shared slurm/18.08.9 rc-base DefaultModules GCCcore/4.9.3 binutils/2.25-GCCcore-4.9.3 GCC/4.9.3-2.25 numactl/2.0.11-GCC-4.9.3-2.25 hwloc/1.11.2-GCC-4.9.3-2.25 OpenMPI/1.10.2-GCC-4.9.3-2.25 OpenBLAS/0.2.15-GCC-4.9.3-2.25-LAPACK-3.6.0 gompi/2016a FFTW/3.3.4-gompi-2016a ScaLAPACK/2.0.2-gompi-2016a-OpenBLAS-0.2.15-LAPACK-3.6.0 foss/2016a Bowtie/1.1.2-foss-2016a ncurses/6.0-foss-2016a zlib/1.2.8-foss-2016a SAMtools/1.3.1-foss-2016a bzip2/1.0.6-foss-2016a Boost/1.61.0-foss-2016a TopHat/2.1.1-foss-2016a
-
Be selective and only load a specific module version that you need for your current workflow. Loading unnecessary modules can lead to conflicts and inefficiencies.
- Before loading modules in a shell/bash/sbatch script, use a clean shell by using
module reset
at the beginning.- What it does:
- Clearing loaded modules.
- Loading default modules specified by the system administrator.
- What it prevents from happening:
- Module conflicts.
- Why it is a best-practice:
- Ensures reproducibility by starting with a clean environment.
- Manages software dependencies effectively.
- What it does:
Using module reset
before loading modules separates what software is loaded in the working shell from the software loaded in the script shell. Be aware that forked processes (like scripts) and Slurm commands inherit the environment variables of the working shell, including loaded modules. Here is an example that shows module conflict between cuda11.8 and cuda11.4 versions that may lead to unexpected behavior or an erroneous output.
# Working shell where you may try testing module load and your run script
$ module load cuda11.4/toolkit
$ module -t list
#Currently Loaded Modules
shared
slurm/18.08.9
rc-base
DefaultModules
cuda11.4/toolkit/11.4.2
# bash script you are passing in a sbatch script
#!/bin/bash
module load cuda11.8/toolkit
module -t list
# Not using `module reset` at the beginning of the bash script could cause CUDA conflict issues.
$ source ./module_test2.sh
#Currently Loaded Modules
shared
slurm/18.08.9
rc-base
DefaultModules
cuda11.4/toolkit/11.4.2
cuda11.8/toolkit/11.8.0
Note
The best practice would be to avoid using module reset
in the Environment Setup
of Open OnDemand jobs as the OOD session, by default, resets the module at the beginning of every session. It is observed to cause unexpected behavior if module reset
is used in the Rstudio server OOD sessions.
Licensed and Commercial Software Restrictions¶
The following software have license restrictions that may preclude some researchers or collaborators depending on their departmental or group affiliations. In the table, "affiliated" means employed by, or a student of, unless otherwise noted. External collaborators are not considered affiliated with UAB for the purposes of software licensing and access, unless otherwise noted. These software packages may be commercial paid software. If you believe you should have access to software that you do not have access to, please contact Support.
Software | Restricted to... | License Holder |
---|---|---|
Amber | Lab Specific | Lab PI |
Ansys | School of Engineering affiliated people | School of Engineering |
Gurobi | Named individual | Individuals |
LS-Dyna | School of Engineering affiliated people | School of Engineering |
Matlab | UAB affiliated people | UAB Campus |
Mathematica | UAB affiliated people | UAB Campus |
SAS | UAB affiliated people | UAB Campus |
Stata | UAB affiliated people | UAB Campus |
Use of these software packages without authorization may be a violation of the UAB IT Acceptable Use Policy.
Security Issues¶
IGV¶
Danger
Versions of IGV prior to 2.11.9
use a compromised version of log4j. Those versions are affected by a serious remote code execution issue. Please transition your software to use versions of IGV >= 2.11.9
.
GSEA¶
Danger
Versions of GSEA prior to 4.2.3
use a compromised version of log4j. Those versions are affected by a serious remote code execution issue. Please transition your software to use versions of GSEA >= 4.2.3
.
Known Issues¶
Matlab Issues¶
There is a critical, hard-to-diagnose MATLAB parpool bug in versions before R2022a.
The issue arises when using a parpool
for multiple jobs simultaneously, as with an sbatch --array
job. MATLAB parpool
can be started manually, or at the first parfor
loop encountered, among other functionality. See the MATLAB Documentation for more information and a complete list.
Before R2022a, MATLAB assumed that only one parpool will be used at a time for each user, and put necessary communication files in a common directory. When multiple parpools are run simultaneously by the same user, they may attempt to write to those files at the same time, corrupting the files, resulting in a range of obscure Parallel Computing Toolbox (PCT) errors. The collisions are effectively random, which can make the issue hard to reproduce and hard to diagnose. The more parpools open simultaneously, the more likely there will be at least one error. In the worst case, we have seen unrecoverable corruption of the parpool common directory, which can be fixed by deleting the directory.
Symptoms of the bug include:
- Excessive load and context switching on affected nodes
- Inconsistent and varied PCT errors
- Inability to start Matlab parpool
To avoid the bug, please use the latest available version of MATLAB and no earlier than R2022a. Upgrading MATLAB versions may require some effort and testing of your code, because MATLAB is not always backwards compatible. Be sure to test that your code works as expected on the new version before using it for research.
If you aren't able to use R2022a or newer, there is a workaround available. Please navigate to this GitHub repository and follow the instructions in README.md
. Some light MATLAB programming is required to effectively use the workaround. Please contact Support if you would like assistance.