The CESM team has developed a post-processing = diagnostics tool using python. This page describes how to use it on chemist= ry output on the NCAR HPC, Cheyenne. At least a full year of data is needed= with two months before and two months after the year. For example, to proc= ess 2004 you need output from November 1, 2003 to March 1, 2005. More infor= mation is at: https://github.com/NCAR/CESM_po= stprocessing/wiki, and the Quick Start Guide is here: https://github.com/NCAR= /CESM_postprocessing/wiki/cheyenne-and-geyser-quick-start-guide

This automated diagnostic tool is useful for understan= ding overarching features of the simulation by comparing with a default set= of observations and/or a previous simulation. It can also be used to produ= ce timeseries (See Sect= ion 5 (below)) for long simulations, reducing the space of output files= by compressing the data. This is highly recommended by the CAM-chem team. = Also note, always only keep output that you really need for your science!

1. First time use: set your shell = environment (see quick start guide)

2. Compile the post processing scripts

a) setup the post-processi= ng in your case directory (on cheyenne, not casper)

> cd <case_dir>
> cesm_pp_activate (opens the virtual environment)
[(NPL) ] > create_postprocess --caseroot <case_dir>
[(NPL) ] > deactivate (closes the virtual environment)

or b) setup post-processing somewhere else

> cd /glade/scratch/<user>/post_processin=
g/
> cesm_pp_activate (opens the virtual environment)
[(NPL) ] > create_postprocess --caseroot /glade/scratch/<user>/pos=
t_processing/<model-run>
[(NPL) ] > deactivate (closes the virtual environment)

If you get the SUCCESS! notification, the &l= t;model-run> folder has been been created in the post_processing = location and analysis code has been added. Note: the <m= odel-run> has to be the same name as your run folder.

3. Edit the .xml post processing scripts

Within the 'postprocess' directory (that was created i= n step #2) edit the scripts.

> ls *xml

Either use pp_config (like xmlchange) or edit th= e following files directly in an editor.

a) `env_postprocess.xml`


If post processing is occurring somewhere other than i=
n <case_dir>, set the location of the model data:

 
  > ./pp_config --set DOUT_S_ROOT=3D<full arch=
ive path of model run output to be analyzed>
 

Example:

 
  > ./pp_config --set DOUT_S_ROOT=3D/gpfs/fs1/scr=
atch/<user>/archive/<model-run>
 

Note: do not add slashes to the end o=
f the path.
Tell the diagnostics what kind of grids to expect. For=
 example the 0.9x1.25 degree resolution:

 
  > ./pp_config --set ATM_GRID=3D0.9x1.25;
> ./pp_config --set ICE_NX=3D288
> ./pp_config --set ICE_NY=3D19
> ./pp_config --set LND_GRID=3D0.9x1.25
 

Other changes:

 
  <entry id=3D"GENERATE_TIMESERIES" value=3D"FALS=
E" /> 
 

You can leave this setting to "TRUE" if you want=
 to generate timeseries (for longer runs).
b) env_diags_atm.xml
Set up to compare with another model run.=


 
  <entry id=3D"ATMDIAG_MODEL_VS_OBS" value=3D"Fal=
se" />
<entry id=3D"ATMDIAG_MODEL_VS_MODEL" value=3D"True" />
<entry id=3D"ATMDIAG_CLEANUP_FILES" value=3D"True" />

 

Test dataset (the run you want to analyse)

 
  <entry id=3D"ATMDIAG_test_compute_climo" value=
=3D"True" />
<entry id=3D"ATMDIAG_test_compute_zonalAvg" value=3D"True" />
 

Control dataset (the run you want to compare with)

 
  <entry id=3D"ATMDIAG_cntl_casename" value=3D"&l=
t;cntr_case_name>" />
<entry id=3D"ATMDIAG_cntl_path_history" value=3D"<path-to-comparison-=
output-on-archive>" />
<entry id=3D"ATMDIAG_cntl_compute_climo" value=3D"True" />
<entry id=3D"ATMDIAG_cntl_compute_zonalAvg" value=3D"True" />
 

Time period of analysis for test and control cases, mi=
nimum 1 year and need output for 2 months either side of the full year to a=
nalyze.

 
  <entry id=3D"ATMDIAG_test_first_yr" value=3D"20=
14" />
<entry id=3D"ATMDIAG_test_nyrs" value=3D"1" />
<entry id=3D"ATMDIAG_cntl_first_yr" value=3D"2014" />
<entry id=3D"ATMDIAG_cntl_nyrs" value=3D"1" />
 

Other diagnostic variables to set

 
  <entry id=3D"ATMDIAG_strip_off_vars" value=3D"F=
alse" />
<entry id=3D"ATMDIAG_netcdf_format" value=3D"netcdfLarge" />
 

Diagnostic sets

 
  <entry id=3D"ATMDIAG_all_chem_sets" value=3D"Fa=
lse" />
 

Then set chem sets to True manually except for chem se=
t #6 (this one takes a long time).
Note 1: Chemistry diagnostic set 2 (C=
set2) will only be calculated when performing a model-model comparison.
Note 2: To ensure all seasons are cal=
culated make sure...
4. Run post-processing scripts
In atm_avera=
ges and atm_diagnos=
tics files, make sure the #PBS account flag is set:
#PBS -A <account_number>
a) Run atm_averages (make s=
ure run time is long enough to produce climo files)

 
  > qsub atm_averages
 

Calculates the climatological values for test and cont=
rol cases (~40 mins for 5 years), check the log files in logs folder.
Find climo files in: $DOUT_S_ROOT/atm/proc/climo=
/$ATMDIAG_test_casename/ and: $DOUT_S_ROOT/atm/proc/climo/$ATMDIAG_cn=
tl_casename/

 
  > qsub atm_diagnostics

CAM-chem automated diagnostics

1. First time use: set your shell = environment (see quick start guide)

2. Compile the post processing scripts

a) setup the post-processi= ng in your case directory (on cheyenne, not casper)

or b) setup post-processing somewhere else

3. Edit the .xml post processing scripts

a) `env_postprocess.xml`

4. Run post-processing scripts

a) Run atm_averages (make s= ure run time is long enough to produce climo files)

> qsub atm_diagnostics

5. Produce timeseries from m= odel output to reduce storage space

a) Change the following general= settings in `env_postprocess.xml` `to generate any timeseries`

b) Define input to timeseries by modifying `env_timeseries.xml`

c) Once timeseries specifications are s= et for all the output streams, run the script

CAM-chem automated diagnostics

1. First time use: set your shell = environment (see quick start guide)

2. Compile the post processing scripts

a) setup the post-processi= ng in your case directory (on cheyenne, not casper)

or b) setup post-processing somewhere else

3. Edit the .xml post processing scripts

a) env_postprocess.xml

4. Run post-processing scripts

a) Run atm_averages (make s= ure run time is long enough to produce climo files)

> qsub atm_diagnostics

5. Produce timeseries from m= odel output to reduce storage space

a) Change the following general= settings in env_postprocess.xml to generate any timeseries

b) Define input to timeseries by modifying env_timeseries.xml

c) Once timeseries specifications are s= et for all the output streams, run the script

a) `env_postprocess.xml`

a) Change the following general= settings in `env_postprocess.xml` `to generate any timeseries`

b) Define input to timeseries by modifying `env_timeseries.xml`