Start running GEM 4 and up - UQAM style


To run GEM one needs appropriate executables as well as a set of scripts and configuration files.
You will find some examples at the end of this page.

1) Set up SSM environment

Do this part only when running on Compute Canada clusters. On our UQAM servers this is already done at account creation.
You need to do part 1) only once!

Please, follow the instructions on the following wiki page:

     Set up the SSM environment


2) Set up the "GEM model environment"

a) Basic setup

Do this part only when running on Compute Canada clusters. On our UQAM servers this is already done at account creation.
You need to do part 2) only once!

On the UQAM servers this part has already been done during the creation of your account. But on Compute Canada clusters you have to execute point 1) to 4) on the following wiki page:

      https://wikiscta.scta.uqam.ca/wikiscta/index.php/Set_up_model_environment


b) Create model working directories

GEM needs a place to run and another to put its listings (log files) while the model is running. Both these places are hardcoded in the model to places under your home. But since your home only has a limited amount of space and the model needs lots of it, these two places need to be links to a place where you have more space and not directories.

First pick a place under which you want to put these model working directories.

Set the variable 'model_space' to the full path of the directory you just created just to be able to use it for the rest of the setup:

    model_space=...

Now create the following directories/links:
(Whenever possible use copy -> paste to avoid typos!!!)

    mkdir -p ${model_space}/EXECDIR ~/MODEL_EXEC_RUN
    ln -s ${model_space}/EXECDIR ~/MODEL_EXEC_RUN/${TRUE_HOST}

    mkdir -p ${model_space}/Listings ~/listings
    ln -s ${model_space}/Listings ~/listings/${TRUE_HOST}

After this you should have:

    ~/MODEL_EXEC_RUN/${TRUE_HOST} -> ${model_space}/EXECDIR
    ~/listings/${TRUE_HOST} -> ${model_space}/Listings

c) Create “storage_model"

When creating your executables the object files, extracted decks and executables will be put in a directory called ${storage_model}. A link will automatically be added to this place from the directory in which you create the executables.

You have to create this directory in a place in which you have space (therefore not under your home) and export the variable “storage_model“ set to that directory.

Export the variable “storage_model“ in your profiles, set to the '.../Storage_Model' directory, for example:

Add this export to these two profiles (if you do not have them, create them):

    ~/.profile.d/.interactive_profile
    ~/.profile.d/.batch_profile

and create the following symbolic link:

    ln -s ~/.profile.d/.batch_profile ~/.profile.d/.ssh_profile

d) Set allocation account name under which to run simulations

When submitting any job on Compute Canada clusters you always need to specify which allocation account should get "billed" for the resources (cores) your job is running on.
Every PI (Principle Investigator - in general your professor) has a certain amount of cores assigned to her/his group which can get used on average(!) throughout the year. They are called "core years". Every PI has a default allocation (def-...) of 50 core years and some PIs also have a research project allocation (rrg-...), which varies in size from one PI to the next and from one year to the next. To get an rrg allocation the PI has to fill out an application once a year, describing the projects for which CC resources will get used.
You can find all the allocation names under which you can run under your directory:

    ~/projects

Pick the one you want to run your simulations under and write its name into a file (which you have to create) called:

    ~/.Account

The above file must only contain the name of the allocation you want to run under and nothing else!

For example:

    $ cat ~/.Account
    def-laprise


3) Extra model scripts

You need to do part 3) only once as well - unless there is an update for the scripts.

The name of the personal scripts directory is not optional and depends on the GEM version you want to run.

For GEM 4.8.lts12 set:
    version='v_4.8.lts12.u'
For GEM 5.0 set:
    version='v_5.0-u1.rc1'
For GEM 5.1.1 set:
    version='v_5.1.1-u1'

Create a directory for all your extra or modified model scripts with:

    mkdir -p ~/modeles/GEMDM/${version}/bin

For now copy all scripts from my directory into yours, for example with:

    cp -p ~winger/modeles/GEMDM/${version}/bin_latest_version/* ~/modeles/GEMDM/${version}/bin

You are done now with the general model setup.

4) The configuration files

GEM needs four configuration files (short "config" files) to run:

When running GEM5 with CLASS/CLASSIC one extra file is needed, called:

Click on the link "configuration files" to learn what they are use for.

You need to create one "config" directory per simulation.
Since these config files are small and very important they should always be kept under the home directory. To make it easier to help each other I suggest you put them somewhere under:

For GEM 4.8.lts12 set:
    version='v_4.8.12'
For GEM 5.0 set:
    version='v_5.0.0'
For GEM 5.1.1 set:
    version='v_5.1.1'

    ~/gem/${version}/Configs/...

Also, to make it easier to find the config files of a simulation that is already running or even finished running, I suggest to set the basename of the "config" directory to the same name as the experiment name of the simulation. So something like:

    ~/gem/${version}/Configs/experiment1
or
    ~/gem/${version}/Configs/project1/experiment1

But please, chose better names than 'project1' or 'experiment1'!!!

The config directory needs to contain the four config files and will also be the directory from which the simulation will get launched.

You can find example config files you can copy into your config directory under:
    ~winger/gem/${version}/Configs/bcmk

Once you are ready to run your ""real" simulations, ask me (winger.katja@uqam.ca) or one of your colleges for a set of config files you can start from.


5) Set the model environment

You need to set the model environment in every window from which you want to run/launch the model or in which you want to create the model executable.

To have access to the model you need to set certain environment variables and make additions to the PATH.

You can do this by executing the following command:

To make our life easier, I created alias for everybody to use on all servers. So instead of the commands above all you have to type now is
    4812500511  respectively.
and your model environment is set.

Among others, these commands also sets the environment variables $gemdyn and $rpnphy which contain the directory name under which you can find all models dynamic recp. physics routines


6) Launching the model

  1. Go into your config directory, the directory in which you have your config files.
  2. Depending on the version you want to run set the model environment with (needs to be done only once per window):
        4812 / 500 / 511
  3. Launch the model with one of the following two commands:

    a)  Um_lance

    This is the basic way to submit the model. When running GEM in climate mode (UQAM style) and your simulation consists of more than one month the model will run one job per month. At the end of each month/job the next month will get submitted automatically.

    b)   Chunk_lance

    While Um_lance works well on our own UQAM servers it can cause big delays on Compute Canada clusters, because each job will be queued again. To avoid these extra queued times there is a 'Chunk_lance' will execute as many months as it can in the given wall time of the job (BACKEND_time_mod). At the end of each job the next job will get submitted automatically. While 'Chunk_lance' can be used for all types of simulations it is especially useful for longer simulations.

    When the model crashes and you want to resubmit it to continue the simulation, all you have to do is go back into you config file directory, set the model environment, and execute 'Chunk_lance' again.

    If ever you want to restart a simulation from the very beginning, you can do so by executing:

          Chunk_lance  -start

To check if your simulation is running use the command:
            qs

 You can also have a look at the example below.

7) What happens while the model is running

The model itself will run under the "execution directory" you created above, called:
    ~/MODEL_EXEC_RUN/machine_name/experiment_name

Each simulation consists of several parts which will all get started/executed/submitted automatically after the initial launch with Um_lance or Chunk_lance.

First some "preparation scripts" will get executed, preparing the execution directory as well as updating your gem_settings.nml and outcfg.out. Then the main "model" will get executed. At the end of each month, and upon request (Out3_close_interval_S) even during a month, the model will submit the "post processing". The post processing will collect and combine and archive all the direct model output, calculate monthly means and variances (if CLIMAT_diagnos=1) and archive the restart files. At the end, the "save listings" job will collect all the listings and put them in your archive (CLIMAT_archdir).

GEM model flow diagram

Note: The model does not depend on post processing job. Even if one of these jobs aborts, the model will continue running.


Listings

While the model is running the listings will appear under the directory:
            ~/listings/${TRUE_HOST}
They will all have ${GEM_exp} in the name.

Once a job/month finished properly they will all get archived under:
            ${CLIMAT_archdir}/Listings/listings_${GEM_exp}.zip


Model output

While the model is running the output will be under:
            ~/MODEL_EXEC_RUN/${TRUE_HOST}/${GEM_exp}/RUNMOD/output/cfg_0000/laststep_...

Whenever the model or the post processing crashes you should find the output produced so far under the above directories.

The post processing collects the model output and archives it under:
            ${CLIMAT_archdir}

Once the post processing is finished you should have the following directories in the above directory":

Samples Raw n-hourly model output
Diagnostics Monthly means and variances calculated by the post processing (only if CLIMAT_diagnos=1 )
Listings Model listings, jobs, and config files
Restarts Restart files which can be used to restart a simulation from given check points. Once a simulation is finished remove several of these files to only keep 1 or 2 per year, since they are fairly large.
Analysis Initial conditions files which can be used to initialize another simulations (only if CLIMAT_out_anal is set)
Pilots Pilot files which can be used to drive another simulation (only if CLIMAT_out_pilot is set)

And then you should also have this file:
    maingemdm_Linux_x86-64.Abs  
which is a copy of the executable.


There are four types of output files:

        dp...  : dynamics on pressure level
        dm... : dynamics on model levels
        pm... : physics on model levels
        pp...  : physics on pressure levels


8) What to do with the model output

There are several ways to look at the model output, which is in the RPN standard file format, and to modify it.
Have a look at this web page for some examples.

9) Create your own executable

For versions 4.8.12 and 5.0.0 click here!

For version  5.1.1 click here!


Example 1: Run a LAM simulation with GEM5.0.0

1) Config files

First you need to create a configuration directory.

For this example create and enter the following directory:

mkdir -p ~/gem/v_5.0.0/Configs/Examples/LAM_60x60
cd ~/gem/v_5.0.0/Configs/Examples/LAM_60x60

Then copy the config files from the example directory into you config directory:

cp ~winger/gem/v_5.0.0/Configs/bcmk/LAM_60x60/* .

2) Submit simulation

Make sure you model environment is set (with command: 500).

Then submit the simulation with:

Um_lance

3) Check simulation

You can check if you are running with:

qs

If ever you need to kill a simulation you can do that with:

    qdel JobID

Click here to see what happens while the model is running.




Author: Katja Winger
Last update: January 2021