Machines

Polaris attempts to be aware of the capabilities of the machine it is running on. This is a particular advantage for so-called “supported” machines with a config file defined for them in the polaris package. But even for “unknown” machines, it is not difficult to set a few config options in your user config file to describe your machine. Then, polaris can use this data to make sure test cases are configured in a way that is appropriate for your machine.

Supported Machines

If you follow the procedure in polaris conda environment, spack environment, compilers and system modules, you will have an activation script for activating the development conda environment, setting loading system modules and setting environment variables so you can build Omega or an MPAS component and work with polaris. Just source the script that should appear in the base of your polaris branch, e.g.:

source load_dev_polaris_0.1.0-alpha.1_anvil_intel_impi.sh

After loading this environment, you can set up tasks or suites, and a link load_polaris_env.sh will be included in each suite or task work directory. This is a link to the activation script that you sourced when you were setting things up. You can can source this file on a compute node (e.g. in a job script) to get the right polaris conda environment, compilers, MPI libraries and environment variables for running polaris tasks and the MPAS model.

Below are specifics for each supported machine

MPAS-Ocean and -Seaice Supported Machines

These are the machines supported by MPAS-Ocean and -Seaice, including the “make target” used to build the MPAS component.

Machine

Compiler

MPI lib.

MPAS make target

anvil

intel

impi

intel-mpi

openmpi

ifort

gnu

openmpi

gfortran

chicoma-cpu

gnu

mpich

gnu-cray

chrysalis

intel

openmpi

ifort

gnu

openmpi

gfortran

compy

intel

impi

intel-mpi

frontier

gnu

mpich

gnu-cray

crayclang

mpich

cray-cray

pm-cpu

gnu

mpich

gnu-cray

intel

mpich

intel-cray

Omega Supported Machines

These are the machines supported by Omega. The MPI library is always the E3SM default for the given machine an compiler.

Machine

Compiler

MPI lib.

chicoma-cpu

gnu

mpich

chrysalis

intel

openmpi

gnu

openmpi

frontier

gnu

mpich

gnugpu

mpich

amdclang

mpich

amdclanggpu

mpich

crayclang

mpich

crayclanggpu

mpich

pm-cpu

gnu

mpich

intel

mpich

nvidia

mpich

pm-gpu

gnugpu

mpich

nvidiagpu

mpich

Note

Omega does not currently support Compy and Anvil.

Other Machines

If you are working on an “unknown” machine, the procedure is pretty similar to what was described in polaris conda environment, spack environment, compilers and system modules. The main difference is that we will use mpich or openmpi and the gnu compilers from conda-forge rather than system compilers. To create a development conda environment and an activation script for it, on Linux, run:

./configure_polaris_envs.py --conda <conda_path> -c gnu -i mpich

and on OSX run:

./configure_polaris_envs.py --conda <conda_path> -c clang -i mpich

You may use openmpi instead of mpich but we have had better experiences with the latter.

The result should be an activation script load_dev_polaris_0.1.0-alpha.1_<mpi>.sh. Source this script to get the appropriate conda environment and environment variables.

Under Linux, you can build the MPAS model with

make gfortran

Under OSX, you can build the MPAS model with

make gfortran-clang

Adding a New Supported Machine

If you want to add a new supported machine, you

Adding a Machine Config File

The first step in adding a new supported machine to add a config file in polaris/machines. The config file needs to describe the parallel environment and some paths where shared Spack environments will be installed and shared data will be downloaded. The easiest place to start is one of the examples provided (machine morpheus for now, but more will be added soon.)

# The parallel section describes options related to running jobs in parallel
[parallel]

# parallel system of execution: slurm, cobalt or single_node
system = single_node

# whether to use mpirun or srun to run a task
parallel_executable = mpirun

# cores per node on the machine
cores_per_node = 8


# Config options related to spack environments
[spack]

# whether to load modules from the spack yaml file before loading the spack
# environment
modules_before = False

# whether to load modules from the spack yaml file after loading the spack
# environment
modules_after = False


# The paths section describes paths that are used within the ocean core test
# cases.
[paths]

# A shared root directory where MPAS standalone data can be found
database_root = /home/xylar/data/mpas/mpas_standalonedata

# the path to the base conda environment where polaris environments have
# been created
polaris_envs = /home/xylar/data/mpas/polaris_envs


# Options related to deploying a polaris conda environment on supported
# machines
[deploy]

# the compiler set to use for system libraries and MPAS builds
compiler = gnu

# the system MPI library to use for gnu compiler
mpi_gnu = openmpi

# the base path for spack environments used by polaris
spack = /home/xylar/data/mpas/spack

# whether to use the same modules for hdf5, netcdf-c, netcdf-fortran and
# pnetcdf as E3SM (spack modules are used otherwise)
use_e3sm_hdf5_netcdf = False


# Options related to machine discovery
[discovery]

# a substring used to identify this machine from its hostname
hostname_contains = morpheus

The [parallel] section should describe the type of parallel queuing system (currently only slurm or single_node are supported), the number of cores per node and the command for running an MPI executable (typically srun for Slurm and mpirun for a “single node” machine like a laptop or workstation.

The [spack] section has some config options to do with loading system modules before or after loading a Spack environment. On a “single node” machine, you typically don’t have modules so both modules_before and modules_after can be set to False. On a high-performance computing (HPC) machine, you may find it is safest to load modules after the Spack environment to ensure that certain paths and environment variables are set the way the modules have them, rather than the way that Spack would have them. The recommended starting point would be modules_before = False and modules_after = True, but could be adjusted as needed if the right shared libraries aren’t being found when you try to build an MPAS component.

In the [paths] section, you will first give a path where you would like to store shared data files used in polaris tasks in database_root. Polaris will create this directory if it doesn’t exist. Then, you can specify polaris_envs as a path where shared conda environments will be installed for polaris releases. If developers always create their own conda environments, this path will never be used.

In [deploy], you will specify config options used in setting up conda and Spack environments for developers. The compiler is the default compiler to use for your system. You must supply a corresponding mpi_<compiler> for each supported compiler (not just the default compiler) that specifies the default MPI library for that compiler. If you only support one compiler and MPI library, that’s pretty simple: compiler is the name of the compiler (e.g. intel or gnu) and mpi_<compiler> is the MPI library (e.g. compiler_gnu = mpich or compiler_intel = openmpi). The spack option specifies a path where Spack environment will be created. The option use_e3sm_hdf5_netcdf = False indicates that you will not use the E3SM default modules for HDF5 and NetCDF libraries (which are not available for machines installed in the way described here).

Finally, [discovery] allows you to add a hostname_contains that is used to automatically identify your machine based on its hostname. If your machine has multiple login nodes with different hostnames, hopefully, a string common to all login nodes can be used here. If your machine has a unique hostname, simply give that. This option saves developers from having to specify --machine <machine> each time they setup polaris environments or test cases.

Describing a Spack Environment

The next step is to create a template YAML file that can be used to create Spack environments for your machine. Polaris uses Spack environments to build packages that need MPI support or which should be build for some other reason with system compilers rather than coming from pre-built conda packages. Using a Spack environment allows these packages to be built together in a consistent way that is not guaranteed if you try to install dependencies one-by-one. In Spack parlance, this is known as unified concretization.

To do this, you will create a file deploy/spack/<machine>_<compiler>_<mpi>.yaml similar to the following example for an Ubuntu laptop:

spack:
  specs:
  - gcc
  - openmpi
{{ specs }}
  concretizer:
    unify: true
  packages:
    all:
      compiler: [gcc@11.3.0]
      providers:
        mpi: [openmpi]
    curl:
      externals:
      - spec:  curl@7.81.0
        prefix: /usr
      buildable: false
    gcc:
      externals:
      - spec: gcc@11.3.0
        prefix: /usr
      buildable: false
  config:
    install_missing_compilers: false
  compilers:
  - compiler:
      spec: gcc@11.3.0
      paths:
        cc: /usr/bin/gcc
        cxx: /usr/bin/g++
        f77: /usr/bin/gfortran
        fc: /usr/bin/gfortran
      flags: {}
      operating_system: ubuntu22.04
      target: x86_64
      modules: []
      environment: {}
      extra_rpaths: []

Typically your system will already have compilers if nothing else, and this is what we assume here. Give the appropriate path (replace /usr with the appropriate path on your system). We have had better luck with gcc than other compilers like Intel so far so for new supported machines so that’s our recommendation. Use gcc --version to determine the version and replace 11.3.0 with this number.

Finally, you might need to update the target and operating_system. This is a bit of a “catch 22” in that you can use Spack to find this out but polaris is designed to clone and set up Spack for you so we assume you don’t have it yet. For now, make your best guess using the info on this page and correct it later if necessary.

You may need to load a system module to get the compilers and potentially other libraries such as MPI, HDF5, and NetCDF-C if you prefer to use system modules rather than having Spack build them. If this is the case, the best way to do this is to add a file conda/spack/<machine>_<compiler>_<mpi>.sh along these lines:

module purge
module load perl/5.32.0-bsnc6lt
module load gcc/9.2.0-ugetvbp
module load openmpi/4.1.3-sxfyy4k
module load intel-mkl/2020.4.304-n3b5fye
module load hdf5/1.10.7-j3zxncu
module load netcdf-c/4.4.1-7ohuiwq
module load netcdf-fortran/4.4.4-k2zu3y5
module load parallel-netcdf/1.11.0-mirrcz7

These modules will be loaded either before or after the spack environment, depending on the modules_before and modules_after config options above. You can also add modules in your YAML file but this shouldn’t be necessary.

For examples from various supported machines, compilers and MPI libraries, see the mache spack directory.

Building the Spack Environment

The next step is to try setting up polaris and asking it to build the Spack environment with a command something like:

  ./configure_polaris_envs.py --verbose --update_spack --conda <conda_path> -c gnu -i openmpi ...

The --update_spack flag tells polaris to create (or update) a Spack environment. You can specify a directory for testing Spack with the --spack flag. You can specify a temporary directory for building spack packages with --tmpdir (this directory must already exist). This is useful if your /tmp space is small (Spack will use several GB of temporary space).

Creating the Spack environment may take anywhere from minutes to hours, depending on your system.

If dependencies don’t build as expected, you may get an error message suggesting that your operating_system or target aren’t right. Here’s an example:

==> Error: concretization failed for the following reasons:
   1. zlib compiler '%gcc@9.4.0' incompatible with 'os=ubuntu20.04'
   2. readline compiler '%gcc@9.4.0' incompatible with 'os=ubuntu20.04'
   3. pkgconf compiler '%gcc@9.4.0' incompatible with 'os=ubuntu20.04'

In this example, I had specified operating_system: ubuntu22.04 in the YAML file but in fact my operating system is ubuntu20.04 as shown in the error message.

You can run:

source $SPACKDIR/share/spack/setup-env.sh
spack arch -o
spack arch -g

where $SPACKDIR is the directory where the Spack repository was cloned by polaris (you should see Cloning into <$SPACKDIR> in the terminal, which will hopefully help you find the right directory). This should hopefully give you something close to what Spack wants. If you get something like x86_64_v4 for the target, use x86_64 instead.

If you are getting other error messages, do your best to debug them but also feel free to get in touch with the polaris development team and we’ll help if we can.

If you get everything working well, please feel free to make a pull request into the polaris main repo to add your supported machine.