2.2. Building and Running HAFS

This chapter walks users through a basic experiment using HAFS. Data comes from Hurricane Laura (2020-08-25 12z) and is already available on disk on supported systems.

Currently, the HAFS application works on these NOAA HPC platforms:

wcoss_dell_p3
wcoss_cray
hera
jet
orion

2.2.1. Set Up the Directory Structure

Note

Once this setup has been performed, it does not need to be repeated for successive experiments.

HAFS is configured to use a particular directory structure for operations, and running HAFS will therefore go more smoothly when users set up a similar directory structure for running HAFS on their own system. On NOAA RDHPC systems, users typically have disk space under a particular project. Users on any system can save their project directory in an environtment variable to reference later:

export PROJECT=/path/to/project/directory

Concretely, a user on Hera with disk space under the epic project might run:

export PROJECT=/scratch2/NAGAPE/epic

Within a directory where the user has write access (e.g., the $PROJECT directory), create the following directories:

mkdir $PROJECT/save
mkdir $PROJECT/noscrub

Within each of these directories, create a directory with your username. For example:

mkdir $PROJECT/save/Jane.Doe
mkdir $PROJECT/noscrub/Jane.Doe

Within noscrub/<username two more directories:

cd $PROJECT/noscrub/Jane.Doe
mkdir save && cd save
mkdir HAFS

The resulting directory should look similar to /scratch2/NAGAPE/epic/noscrub/Jane.Doe/save/HAFS.

2.2.2. Clone and Checkout the Repository

cd $PROJECT/save/<username>
git clone -b <BRANCH> --recursive https://github.com/hafs-community/HAFS.git

Select the branch to clone by setting the <BRANCH> option to the branch name. In general, the production/hafs.v# branch with the highest version number will be up-to-date with code for the most recent or upcoming release. The feature/ branches contains code that is in development by various HAFS developers. This code is typically merged to develop and/or to the upcoming production/hafs.v# branch when it is ready.

Note

develop is the default branch; the develop branch will be cloned if no branch is specified.

2.2.3. Build and Install HAFS

The install_hafs.sh script builds HAFS by calling several other scripts with distinct functions:

machine-setup.sh Determines shell, identifies machine, and loads modules

build_all.sh Compiles components: forecast, post-processing, tracker, utils, tools, hycom, ww3, and gsi

install_all.sh Copies executables to exec directory

link_fix.sh Links fix files (fix files are available on disk on supported platforms)

To build HAFS, navigate to sorc and run the installation script:

cd $PROJECT/save/<username>/HAFS/sorc
./install_hafs.sh > install_hafs.log 2>&1

Note

Building HAFS can take a while, potentially upwards of an hour. To see output printed to the console as HAFS builds, users can omit > install_hafs.log 2>&1 when running install_hafs.sh.

Once install_hafs.sh has run, install_hafs.log should appear in the sorc directory. Users can also check the log files in the HAFS/sorc/logs directory to see if the build was successful or if there were any errors. A successful build should result in a build_*.log file for each executable:

build_forecast.log

build_gsi.log

build_hycom_utils.log

build_post.log

build_tools.log

build_tracker.log

build_utils.log

build_ww3_utils.log

Additionally, several executables should appear in a new HAFS/exec directory. These executables include:

hafs_forecast_*.x

hafs_gsi_enkf.x

hafs_gsi.x

hafs_hycom_utils_*.x

hafs_post.x

hafs_tools_*.x

hafs_tracker_*.x

hafs_utils_*.x

hafs_ww3_*.x

Hint

Got errors? Look into the HAFS/sorc/logs directory.

2.2.4. Run the HAFS System

Before beginning, it may be useful to save the path to the HAFS directory in an environtment variable to reference later:

export HAFS=$PROJECT/save/<username>/HAFS/sorc

Concretely, a user on Hera with disk space under the epic project might run:

export HAFS="/scratch2/NAGAPE/epic/save/Jane.Doe/HAFS"

The remaining documentation will use the environment variable $HAFS to refer to an equivalent location. Users will need to reset this environment variable for every login session. Users may enter the path to this directory instead of using an environment variable.

2.2.4.1. Edit system.conf

To configure an experiment, run:

cd $HAFS/parm
cp system.conf.<system> system.conf
vi system.conf

where <system> is replaced by the name of one of the supported platforms listed above.

Edit the following:

disk_project: Project name for disk space.

tape_project (optional): HPSS project name.

cpu_account: CPU account name for submitting jobs to the batch system (may be the same as disk_project)

archive=disk: Archive location (make sure you have write permission) * e.g., /scratch2/NAGAPE/epic/save/Jane.Doe

CDSAVE: HAFS parent directory

CDNOSCRUB: Track files will be copied to this location — contents will not be scrubbed (user must have write permission)

CDSCRUB If scrub is set to yes, this directory will be removed (user must have write permission)

For example, an edited system.conf file on Hera might resemble the following for an imaginary user Jane Doe:

## This is the system-specific configuration file for Hera
[config]
## Project disk area
disk_project=epic
## Project hpss tape area
tape_project=emc-hwrf
## CPU account name for submitting jobs to the batch system.
cpu_account=epic
## Archive path
archive=disk:/scratch2/NAGAPE/epic/Jane.Doe

[dir]
## Save directory.  Make sure you edit this.
CDSAVE=/scratch2/NAGAPE/epic/save/Jane.Doe
## Non-scrubbed directory for track files, etc.  Make sure you edit this.
CDNOSCRUB=/scratch2/NAGAPE/epic/noscrub/Jane.Doe/hafstrak
## Scrubbed directory for large work files.  Make sure you edit this.
CDSCRUB=/scratch2/NAGAPE/epic/scrub/Jane.Doe
## Syndat directory for finding which cycles to run
syndat=/scratch1/NCEPDEV/hwrf/noscrub/input/SYNDAT-PLUS
COMOLD={oldcom}
COMIN={COMhafs}
COMOUT={COMhafs}
COMINnhc={ENV[DCOMROOT|-/dcom]}/nhc/atcf/ncep
COMINjtwc={ENV[DCOMROOT|-/dcom]}/{ENV[PDY]}/wtxtbul/storm_data
COMgfs=/scratch1/NCEPDEV/hwrf/noscrub/hafs-input/COMGFSv16
COMINobs={COMgfs}
COMINgfs={COMgfs}
COMINgdas={COMgfs}
COMINarch={COMgfs}/syndat
COMrtofs=/scratch1/NCEPDEV/hwrf/noscrub/hafs-input/COMRTOFSv2
COMINrtofs={COMrtofs}
COMINmsg={COMINgfs}
COMINhafs={COMINgfs}
DATMdir=/scratch1/NCEPDEV/{disk_project}/noscrub/{ENV[USER]}/DATM
DOCNdir=/scratch1/NCEPDEV/{disk_project}/noscrub/{ENV[USER]}/DOCN
## A-Deck directory for graphics
ADECKhafs=/scratch1/NCEPDEV/hwrf/noscrub/input/abdeck/aid
## B-Deck directory for graphics
BDECKhafs=/scratch1/NCEPDEV/hwrf/noscrub/input/abdeck/btk
## cartopyDataDir directory for graphics
cartopyDataDir=/scratch1/NCEPDEV/hwrf/noscrub/local/share/cartopy

2.2.4.2. HAFS Physics Configuration

Look in HAFS/parm/hafs.conf to determine what physics suites are running. For HAFS v2, the physics suites are:

ccpp_suite_regional=FV3_HAFS_v1_thompson_nonsst

ccpp_suite_glob=FV3_HAFS_v1_thompson_nonsst

ccpp_suite_nest=FV3_HAFS_v1_thompson_nonsst

To determine what physics schemes are included in the suites mentioned above, run:

more /path/to/HAFS/sorc/hafs_forecast.fd/FV3/ccpp/suites/suite_FV3_HAFS_v1_gfdlmp_tedmf_nonsst.xml

where path/to is replaced by the path to the HAFS clone.

2.2.4.3. HAFS Nesting Configuration

Two types of nesting configurations are available: (i) regional* and (ii) globnest.

Two namelist files (templates) for regional configuration are:
- HAFS/parm/forecast/regional/input.nml.tmp
- HAFS/parm/forecast/regional/input_nest.nml.tmp
One namelist file (template) for globnest configuration is:
- HAFS/parm/forecast/globnest/input.nml.tmp

An example namelist template file for HAFS (updated 07/26/2024) starts with:

&atmos_model_nml
  blocksize = @[blocksize]
  chksum_debug = .false.
  dycore_only = .false.
  avg_max_length = 10800.
  ccpp_suite = '@[ccpp_suite_nml]'
  ignore_rst_cksum = .true.
/

* operational implementation

2.2.4.4. XML File to Run the Workflow

Navigate to the rocoto directory and alter the workflow XML file as needed.

cd /path/to/HAFS/rocoto
vi hafs_workflow.xml.in

In HAFS/rocoto/hafs_workflow.xml.in the following can be modified to set the number of cycles and tasks.

<!ENTITY CYCLE THROTTLE “5”>: The number of cycles that can be activated at one time
<!ENTITY TASK_THROTTLE “120”>: The number of tasks that can be activated at one time
<!ENTITY MAX_TRIES “1”>: The maximum number of tries for all tasks

2.2.4.5. Edit the Cron Job Driver Script

Change the cron job driver script to set up the experiment and storm.

cd /path/to/HAFS/rocoto
vi cronjob_hafs_rt.sh

Make sure to uncomment #set -x and edit HOMEhafs as appropriate. For example:

#!/bin/sh
set -x
date

HOMEhafs=${HOMEhafs:-/scratch2/NAGAPE/epic/save/<username>/HAFS}
#HOMEhafs=${HOMEhafs:-/scratch2/NAGAPE/epic/noscrub/Gillian.Petro/save/HAFS}

Additionally, comment out any tests you do not want to run by placing a # in front of the lines that start with ./run_hafs.py.

2.2.4.6. Workflow Dependencies

For a non-ensemble forecast, Table %s describes the task dependencies.

Stage	Task	Description
1	Launch	Launch the workflow
2	input ???
2	atm_prep
2	atm_prep_mvnest
3	atm_ic	Prepare atmospheric initial conditions files
3	atm_ic_fgat##	Prepare atmospheric initial conditions FGAT files
3	atm_lbc#	Prepare atmospheric boundary conditions files
3	ocn_prep
3	wav_prep
4	atm_init
4	atm_init_fgat##
4	obs_prep
5	atm_merge ??? not always necessary?
5	atm_merge_fgat## ??? not always necessary?
5	atm_vi
5	atm_vi_fgat##
5	analysis
5 (when RUN_ATM_INIT==YES) or 6	analysis_merge
7 (possibly sooner depending on whether it is a simple of complex workflow with additional steps)	forecast
8	unpost
9	atm_post	Post-processing of atmospheric model output
9	ocn_post	Post-processing of ocean model output
9	wav_post	Post-processing of wave model output
10	product	Product generation?
10	atm_init_ens
11	output
11	gempak

2.2.4.7. Run HAFS and Check Progress

Load the Rocoto module and run the driver script in the rocoto directory to launch the experiment:

module load rocoto
./cronjob_hafs_rt.sh

To run through all tasks in the experiment, tasks need to be launched once their dependencies are satisfied. Users can launch tasks manually by running the rocotorun command regularly and repeatedly until all tasks are complete:

rocotorun -d hafs-<workflow_name>.db -w hafs-<workflow_name>.xml

where <workflow_name> is replaced with the full name of the database (.db) and Rocoto XML files. For example:

rocotorun -d hafs-HAFS_rt_regional_atm-13L-2020082512.db -w hafs-HAFS_rt_regional_atm-13L-2020082512.xml

Instead of running rocotorun manually, users can instead automate this task by adding it to a crontab on systems where cron is available:

crontab -e
*/5 * * * * cd /path/to/save/<username>/HAFS/rocoto && ./cronjob_hafs_rt.sh

For example, a user named Jane Doe might paste */5 * * * * cd /scratch2/NAGAPE/epic/save/Jane.Doe/HAFS/rocoto && ./cronjob_hafs_rt.sh into her crontab.

Note

On Orion, cron is only available on the orion-login-1 node.

To check experiment progress, users can run the rocotostat command with the same arguments as rocotorun described above:

rocotostat -d hafs-<workflow_name>.db -w hafs-<workflow_name>.xml

To check which specific tasks are in progress, users can run:

squeue -u <username>

If something goes wrong during the workflow, log files can be found in $CDSCRUB. The location of this directory is set in parm/system.conf. Specifically, log files will be located in $CDSCRUB/<reponame>/<cycle_date>/<storm_id> (e.g., $CDSCRUB/HAFS/2020082512/13L).

Note

Storm IDs are a number-letter combination indicating the storm number for a particular year and the storm basin it appeared in. For example, 13L is the 13th storm in the Atlantic basin for a particular year.

L = Atlantic basin

E = Eastern North Pacific basin

C = Central North Pacific basin

W = Western North Pacific basin

IO = North Indian Ocean

SH = Southern Hemisphere

2.2.4.8. Experiment Directory Structure

Several directories are generated over the course of a HAFS experiment. The directory structure diagram below summarizes how these directories are organized.

disk_project
  └── save
  |     └── Username
  |           └── HAFS
  |                 ├── parm
  |                 |     └── *.conf (configuration files)
  |                 ├── rocoto
  |                 |     ├── cronjob_*.sh
  |                 |     ├── hafs_rt_status.sh
  |                 |     ├── hafs-<repo>-##L-<cycledate>.db
  |                 |     ├── hafs-<repo>-##L-<cycledate>.xml
  |                 |     ├── hafs_workflow.xml.in
  |                 |     ├── rocoto_util.sh
  |                 |     └── run_hafs.py
  |                 └── sorc
  ├── scrub
  |     └── Username
  |           └── HAFS
  |                 ├── 2020082512
  |                 |     └── 13L
  |                 |           └── *.log (log files for each task)
  |                 ├── com
  |                 |     └── 2020082512
  |                 |           └── 13L
  |                 |                 ├── 13l.2020082512.*
  |                 |                 ├── storm1.conf
  |                 |                 └── storm1.*
  |                 └── log
  |                       ├── jlogfile
  |                       └── rocoto_<cycle>.log
  └── noscrub
        └── Username
              └── hafstrak