Distributed Coupling-Mode Decomposed-AQM Modeling System

Summary
Domain Decomposition
Setting up Coupled-Mode Cooperative Modeling
Program aqmaster
Program metserver
I/O API Coupling Mode

Summary

The Distributed Coupling-Mode Decomposed-AQM Modeling System uses a cooperating system of "normal" air quality models on rectangular subdomains, and programs aqmmaster and metserver. This approach may be thought of as a special case of the nesting approach used by the real-time MAQSIP air quality model: each nest grid has a single air quality model working on just that grid; these AQM's cooperate as a coupled parallel system, with the coarser grids providing time dependent boundary conditions to the finer grids nested within them, and potentially aggregation data on the coverage within the coarser grids of concnetrations on the finer grid nested within them.

The modeling process begins by decomposing the domain into rectangular subdomains that overlap properly, and then putting the description of these subdomains into an I/O API-standard GRIDDESC file. The metserver program reads the full-domain grid-geometry, emissions, and meteorology files, and from them constructs windowed grid-geometry, emissions, and meteorology files for each subdomain in the decomposition. At every advection step, the aqmmaster program assembles full-domain concentrations from the outputs of the subdomain air quality models, and provides time stepped boundary coonditions back to them, as well as producing full-domain concentration outputs. The whole system is tied together with the coupling-mode extensions for the Models-3 I/O API to perform distributed parallel domain-decomposed air quality modeling across a set of machines.

Notice that all of the scheduling, coordination, assembly, and extraction activities are managed by the aqmmaster program, so that the subdomain air quality models are unmodified (except for linking with the coupling-mode version of the I/O API library and the PVM library, in addition to the usual netCDF library). The source code of the AQM is unaffected by this cooperating-process parallelism. No more work writing schedulers, boundary-extractors, etc., needs to be done by the modeler!

Both programs aqmmaster and metserver are Fortran-90 dynamically-sized programs that adapt at run time to the sets of met and chemistry variables being modeled, and to the grids being run. They are basically independent of AQM being run, as well, as long as the AQM uses the Models-3 I/O API for input and output, uses the basic Models-3 scheme for meteorology file types, and avoids deadlocks, and as long as the gridded met files have all the variables necessary for windowing to produce subdomain boundary met files.

Back to Contents

Domain Decomposition

Example of a Domain Decomposition

In this example, we begin with a grid having 18 rows and 24 columns. We decompose this domain into three subdomains, as illustrated below; for subdomain modeling purposes, each subdomain will be extended with a "halo" by one cell along each internal boundary.

Subdomain 1: rows 1-18, columns 1-8
Subdomain 2: rows 1-9, columns 9-24
Subdomain 3: rows 10-18, columns 9-24

The reason for requiring this halo is the fact that in all existing CMAQ and MAQSIP implementations, there are errors in the implementation of thickened-boundary advection. If these errors were corrected, then the halos (and the computational overhead that goes with them) would no longer be necessary.

The portion of Subdomain 1 that is actually used to generate concentration field output is as described above; however, in order to preserve the full order accuracy of the horizontal advection numerics, the air quality model for Subdomain 1 actually models a region with a "halo" one column wider (and extending into Subdomains 2 and 3). The boundary for this subdomain then "lives" partly on column 10 of the original full domain. It is the responsibility of the aqmmaster program to gather rows 1-18, columns 1-8 from Subdomain 1 for the full-domain concentration output, and to provide column 10 as part of the (time-dependent) boundary values for Subdomain 1.

Similarly, the air quality model for Subdomain 2 actually models a 10 row by 17 column region with an L-shaped halo; the boundary for Subdomain 2 includes portions of column 6 and row 11, as illustrated below. Subdomain 3 is left as an exercise for the reader :-)
Subdomain 2

Back to Contents

Setting up Coupled-Mode Cooperative Modeling

The for using the new Coupling Mode of the Models-3 I/O API was developed as part of the MCNCPractical Parallel Computing Strategies Project, which was partially funded by the National Center for Environmental Research and Quality Assurance, Office of Research and Development, U.S. Environmental Protection Agency, under the Science To Achieve Results Grants Program in high performance computing and communications. The basic idea was that by changing the low-level data storage layer in the I/O API so that it had an alternative communications implementation in addition to the existing (netCDF-based) file-storage implementation, one could use existing single-topic models to build more complex cooperating process multi-topic coupled modeling systems, the choice of storage mode being made at program launch on the basis of environment variables. Moreover, the individual single-topic models would not "know" (nor would they need to know!) at the source code level whether they were running stand-alone or as part of a coupled modeling system. The only requirement for such coupled modeling was that input operations (I/O API OPEN3() for input files, READ3(), INTERP3(), XTRACT3(), and DDTVAR3()) should block when they request data that is not yet available (i.e., they should put the requester to sleep until the data's producer writes it out, and then wake up the requester and allow it to continue). This is made possible by the selective direct access nature of I/O API calls, and in fact was one of the original design goals of the Models-3 system.

For the particular case of domain decomposed air quality modeling, the way this works is as follows:

Decompose your domain into rectangular subdomains, as described above. (In the absence of other GUI tools for this purpose, the "zoom" feature of PAVE (acting. e.g., on a cross-point gridded met file) can be useful to help visualize subdomains as you do this task.
Build executables for all the subdomain AQMs (this is necessary if the subdomain description "compiles into" the AQM executables, as it does with MAQSIP and with the current version of CMAQ; it is rumored not to be true for the next version of CMAQ).
If running from file-based meteorology and/or emissions, run metserver in the appropriate mode to
Compute the IOAPI_KEEP_NSTEPS necessary for each coupled-mode process. This is the number of time steps that it is necessary to keep in PVM-mailbox memory for the system to operate.
Start up a PVM session that includes all the host machines being used to run the coupled modeling system.
For each coupled-mode process, launch it in the background on the appropriate host machine.
Wait until everything is completed.
Shut down the PVM session.

Back to Contents

Program `aqmaster`

Description

The sequence of operation for aqmaster is as follows:

read in all the control parameters (starting date, starting time, etc., as given in the Environment Variables section below.
Open the full-domain chemical initial and boundary condition files for input. Note that this determines both the full-domain grid structure and the set of chemical species that will be modeled.
Create/opens all the subdomain chemical initial and boundary condition file for output.
Opens the subdomain chemical concentration files for input.
For every subdomain and for every chemical species:
1. Window the full domain concentrations to the subdomain grid.
2. Write the subdomain concentrations grid to the subdomain chemical initial condition file
For every time step:
1. For every subdomain and for every chemical species:
  1. Read in the subdomain concentrations grid from the subdomain concentration file.
  2. Aggregate the subdomain concentrations into the full domain concentrations grid.
  3. Construct the subdomain boundary concentrations.
  4. Write the subdomain boundary concentrations to the subdomain chemical boundary condition file.

Notice that the order of operations is carefully laid out so as to avoid deadlocks, and so as to allow aqmaster to act as a component in a cooperating-process implementation of a nested AQM. It is also laid out so as to allow coupled aqmaster and metserver to operate within a cooperating process real time environmental modeling system with additional meteorological and emissions model components (avoiding race conditions in such a system, particularly, is the role of the optional SYNCH_FILE).

Required Environment Variables

The execution of program aqmaster is controlled purely by environment variables, for easy scriptability. Some of these variables are control parameter variables, others are logical name environment variables for the input and output files, which contain the path-names for the respective files, according to Models-3 conventions. These environment variables may be set by the csh setenv command, or by the sh or ksh set and export or env commands. The list of environment variables for aqmaster is the following.

Control Parameters

STDATE
starting date, given in YYYDDD according to Models-3 conventions.
STTIME
starting time, given in HHMMSS according to Models-3 conventions.
CPLSTEP
coupling time-step, HHMMSS (should match the advection or model time step of a full domain AQM)
OUTSTEP
full domain output time-step, HHMMSS
RUNLEN
run duration, HHMMSS
GRIDDESC
path name for the GRIDDESC file used in Models-3 compliant systems to store grid and coordinate system descriptions.
WINDOW_XGRIDS
comma-delimited list of subdomain grid-names, as they appear in the GRIDDESC file. The number of entries in this list determines the number SUB_COUNTof subdomains being modeled, and hence the numbers of subdimain initial condition, boundary condition, and concentration files used in the coupled modeling system.
IOAPI_KEEP_NSTEPS
number of time steps to keep in PVM mailbox buffers, computed in terms of the interaction between the emissions, meteorology, and air quality tiem steps of the models used in the coupled system.
Input File Logical Names

CHEM_BDY_3D
Logical name for the input full-domain chemical boundary condition file
CHEM_INIT_3D
Logical name for the input full-domain chemical initial condition file
CHEM_CONC_3D
Logical name for the output full-domain chemical concentration file
Output Subdomain-File Logical Names

CHEM_BDY_3D_G<nn>, for nn=1,...,SUB_COUNT
Logical names for the output subdomain chemical boundary condition files
CHEM_INIT_3D_G<nn>, for nn=1,...,SUB_COUNT
Logical names for the output subdomain chemical initial condition files
CHEM_CONC_3D_G<nn>, for nn=1,...,SUB_COUNT
Logical names for the input subdomain chemical concentration files
Optional QA Output File Logical Names

QA_CRO_2D
Logical name for the optional domain decomposition QA-check file, or NONE to turn this file off.
SYNCH_FILE
Logical name for the optional synch-file to avoid race conditions with other models, or NONE to turn this file off.

Back to Contents

Program `metserver`

Description

The sequence of operation for metserver is as follows:

read in all the control parameters (starting date, starting time, etc.), as given in the Environment Variables section below. Options include turning on or off each of the families of files below; note that the sets of variables in each input file determine at runtime the sets of variables in the corresponding output subdomain files:
1. CHEM_EMIS_3D
2. GRID_BDY_2D
3. GRID_BDY_3D
4. GRID_CRO_2D
5. GRID_CRO_3D
6. GRID_DOT_2D
7. MET_BDY_2D
8. MET_BDY_3D
9. MET_CRO_2D
10. MET_CRO_3D
11. MET_DOT_2D
12. MET_KF_2D
13. MET_KF_3D
Perform consistency checks:
1. Subdomain grids fit together correctly to form the full domain grid.
2. If both are being produced, the time step for the CHEM_EMIS_3D file must be an exact multiple of, ro exactly the same as, the met time step. If the met files are not being produced set the met time step artificially to be the emissions time step, to allow the deadlock-free interleaved processing algorithm, below.
3. If GRID_BDY_2D is turned on, then GRID_CRO_2D must be available and must contain the needed variables;
4. If GRID_BDY_3D is turned on, then GRID_CRO_3D must be available and must contain the needed variables;
5. If MET_BDY_2D is turned on, then MET_CRO_2D must be available and must contain the needed variables;
6. If MET_BDY_3D is turned on, then MET_CRO_3D must be available and must contain the needed variables;
If the GRID_*_2D files are being produced, then for each variable within them:
1. Read the grid and boundary values of the variable from the input files, as appropriate (from both, if it is a boundary variable; from the gridded file only, if it is a gridded-only variable);
2. If the variable is a boundary variable, construct an "expanded domain" grid of that variable (including both the boundary and the cross-point-grid cells).
3. For each subdomain:
  1. If the variable is a boundary variable, extract/construct the subdomain boundary values from the "expanded domain" grid and write then to the subdomain boundary file.
  2. Extract the subdomain cross point gridded values from either the full domain or the "expanded domain" grid (as appropriate), and write then to the subdomain cross-point gridded file.
Similarly for the GRID_*_3D files.
For each output met-file time step:
1. If the MET_*_2D files are being produced, then for each variable within them:
  1. Read the grid and boundary values of the variable from the input files, as appropriate (from both, if it is a boundary variable; from the gridded file only, if it is a gridded-only variable);
  2. If the variable is a boundary variable, construct an "expanded domain" grid of that variable (including both the boundary and the cross-point-grid cells).
  3. For each subdomain:
    1. If the variable is a boundary variable, extract/construct the subdomain boundary values from the "expanded domain" grid and write then to the subdomain boundary file.
    2. Extract the subdomain cross point gridded values from either the full domain or the "expanded domain" grid (as appropriate), and write then to the subdomain cross-point gridded file.
  Note about KF Files: the MET_KF_* are always physical files (not virtual) and are written in MM5 *before* the first write to any MET_CRO* file; in the AQM, they are read after several reads from the MET_CRO* files. Sandwiching MET_KF_* processing between MET_CRO_2D and MET_CRO_3D processing guarantees synchronization in coupling mode operation. Note also that for the first time step iteration, we must be careful to "capture" all events currently in progress.
2. If the MET_KF_2D file is being produced, then window it and write the result to the subdomain files.
3. Similarly for the MET_KF_3D files.
4. Process the MET_CRO*_3D files in the same fashion as the MET_CRO_2D files.
5. Similarly for the MET_DOT_3D files.
6. If the CHEM_EMIS_3D file is being produced, for each emissions variable:
  1. Read the grid and boundary values of the variable from the input file.
  2. Extract the subdomain cross point gridded values from the full domain grid, and write then to the subdomain cross-point gridded file.

Notice that the order of operations is carefully interleaved so as to avoid deadlocks in a cooperating process environmental modeling system, and so as to allow metserver to act as a component in a cooperating-process implementation that includes concurrent meteorological and emissions models that generate the full-domain inputs to the distributed air quality model.

Required Environment Variables

Execution of metserver is completely controlled by environment variables, for easy scriptability. These may be set by the csh setenv command, or by the sh or ksh set and export or env commands. The list of environment variables for metserver is the following.

Control Parameters

STDATE
starting date, given in YYYDDD according to Models-3 conventions.
STTIME
starting time, given in HHMMSS according to Models-3 conventions.
TSTEP
Meteorology time-step, HHMMSS
ESTEP
Emissions time-step, HHMMSS; must be a multiple of the meteorology time-step.
RUNLEN
run duration, HHMMSS
GRIDDESC
path name for GRIDDESC file
WINDOW_XGRIDS
comma-delimited list of subdomain cross-point grid-names, as they appear in the GRIDDESC file. The number of entries in this list determines the number SUB_COUNTof subdomains being modeled, and hence the numbers of subdimain meteorology and emissions files used in the coupled modeling system.
WINDOW_DGRIDS
comma-delimited list of subdomain dot-point grid-names. The number of entries in this list must match the number of entries in WINDOW_XGRIDS.
IOAPI_KEEP_NSTEPS
number of time steps to keep in PVM mailbox buffers, computed in terms of the interaction between the emissions, meteorology, and air quality time steps of the models used in the coupled system.

Input File Logical Names

CHEM_EMIS_3D
Logical name of the input full domain cross point 3-D (layered) chemical emissions file, or NONE to turn emissions processing off.
GRID_BDY_2D
Logical name of the input full domain boundary point 2-D grid geometry file, or NONE to turn this part of the processing off.
GRID_BDY_3D
Logical name of the input full domain boundary point 3-D (layered) grid geometry file, or NONE to turn this part of the processing off.
GRID_CRO_2D
Logical name of the input full domain cross point gridded 2-D grid geometry file, or NONE to turn this part of the processing off.
NOTE: this file is required for GRID_BDY_2D processing.
GRID_CRO_3D
Logical name of the input full domain cross point gridded 3-D (layered) grid geometry file, or NONE to turn this part of the processing off.
NOTE: this file is required for GRID_BDY_3D processing.
GRID_DOT_2D
Logical name of the input full domain dot point gridded 2-D grid geometry file, or NONE to turn this part of the processing off.
MET_BDY_2D
Logical name of the input full domain boundary point 2-D meteorology file, or NONE to turn this part of the processing off.
MET_BDY_3D
Logical name of the input full domain boundary point 3-D (layered) meteorology file, or NONE to turn this part of the processing off.
MET_CRO_2D
Logical name of the input full domain cross point gridded 2-D meteorology file, or NONE to turn this part of the processing off.
NOTE: this file is required for MET_BDY_2D processing.
MET_CRO_3D
Logical name of the input full domain cross point gridded 3-D (layered) meteorology file, or NONE to turn this part of the processing off.
NOTE: this file is required for MET_BDY_3D processing.
MET_DOT_3D
Logical name of the input full domain dot point gridded 3-D(layered) meteorology file, or NONE to turn this part of the processing off.

Output Subdomain-File Logical Names

CHEM_EMIS_3D_G<nn>, for nn=1,...,SUB_COUNT
Logical name of the output cross point 3-D (layered) chemical emissions file, if this part of the processing is turned on.
GRID_BDY_2D_G<nn>, for nn=1,...,SUB_COUNT
Logical name of the output boundary point 2-D grid geometry file, if this part of the processing is turned on.
GRID_BDY_3D_G<nn>, for nn=1,...,SUB_COUNT
Logical name of the output boundary point 3-D grid geometry file, if this part of the processing is turned on.
GRID_CRO_2D_G<nn>, for nn=1,...,SUB_COUNT
Logical name of the output cross point gridded 2-D grid geometry file, if this part of the processing is turned on.
GRID_CRO_3D_G<nn>, for nn=1,...,SUB_COUNT
Logical name of the output cross point gridded 3-D (layered) grid geometry file, if this part of the processing is turned on.
GRID_DOT_2D_G<nn>, for nn=1,...,SUB_COUNT
Logical name of the output dot point gridded 2-D grid geometry file, if this part of the processing is turned on.
MET_BDY_2D_G<nn>, for nn=1,...,SUB_COUNT
Logical name of the output boundary point 2-D meteorology file, if this part of the processing is turned on.
MET_BDY_3D_G<nn>, for nn=1,...,SUB_COUNT
Logical name of the output boundary point 3-D (layered) meteorology file, if this part of the processing is turned on.
MET_CRO_2D_G<nn>, for nn=1,...,SUB_COUNT
Logical name of the output cross point gridded 2-D meteorology file, if this part of the processing is turned on.
MET_CRO_3D_G<nn>, for nn=1,...,SUB_COUNT
Logical name of the output cross point gridded 3-D (layered) meteorology file, if this part of the processing is turned on.
MET_DOT_3D_G<nn>, for nn=1,...,SUB_COUNT
Logical name of the output dot point gridded 3-D (layered) meteorology file, if this part of the processing is turned on.

Back to Contents

I/O API Coupling Mode

As part of the Practical Parallel Project, MCNC has developed an extended Model Coupling Mode for the I/O API. This mode, implemented using PVM 3.4 mailboxes, allows the user to specify in the run-script whether "file" means a physical file on disk or a PVM mailbox-based communications channel (a virtual file), on the basis of the value of the file's logical name:


    setenv FOO                "virtual BAR"
    setenv IOAPI_KEEP_NSTEPS  3

declares that FOO is the logical name of a virtual file whose physical name (in terms of PVM mailbox names) is BAR. The additional environment variable IOAPI_KEEP_NSTEPS determines the number of time steps to keep in PVM mailbox buffers -- if it is 3 (as here), and there are already 3 timesteps of variable QUX in the mailboxes for virtual file FOO, then writing a fourth time step of QUX to FOO causes the earliest time step of QUX to be erased, leaving only timesteps 2, 3, and 4. This is necessary, so that the coupled modeling system does not require an infinite amount of memory for its sustained operation. If not set, IOAPI_KEEP_NSTEPS defaults to 2 (the minimum needed to support INTERP3()'s double-buffering).

The (UNIX) environments in which the modeler launches multiple models each of which reads or writes from a virtual file must all agree on its physical name (usually achieved by sourcing some csh script that contains the relevant setenv commands).

For models exchanging data via virtual files of the I/O API's coupling mode, the I/O API schedules the various processes on the basis of data availability:

The modeler must start up a PVM session that will "contain" all the virtual files and enroll in it all those machines which will be running the various modeling programs before starting up the various models in a coupled modeling system on those respective machines.
OPEN3() calls for read-access to virtual files that haven't yet been opened for write access by some other process put the caller to sleep until the file is opened; and
READ3(), INTERP3(), or DDTVAR3() calls for virtual-file data which has not yet been written put the reading process to sleep until the data arrives, at which point the reader is awakened and given the data it requested.

There are three requirements on the modeler:

structuring reads and writes so as to avoid deadlocks (two or more models, each asleep while waiting for input from the other); and
providing enough feedbacks to prevent one process from "racing ahead" of the others. In a one-way coupled system, this may mean the introduction of artificial synchronization files which exist solely to provide these feedbacks; and
Computing the IOAPI_KEEP_NSTEPS for each process. This is the number of time steps that must be kept in PVM-mailbox memory for that process to function within the coupled-model system. In a typical example, the MM5 output time steps might be 10 minutes, the SMOKE output time steps are 1 hour, and the AQM output time steps might be 20 minutes. In this case, the AQM simulation may need to fall as much as an 80 minutes behind the MM5 simulation, because of lags waiting for interpolatable emissions, etc. This represents IOAPI_KEEP_NSTEPS = 9 = 1 + (80 minutes)/(10 minutes) time steps for MM5, IOAPI_KEEP_NSTEPS = 3 for SMOKE (with its 1-hour time steps), and IOAPI_KEEP_NSTEPS = 5 = 1 + (80 minutes)/(20 minutes) for the AQM.

Using coupling mode to construct complex modeling systems has several advantages from the model-engineering point of view:

The same programs work unchanged both in standalone mode (reading input from files and writing output to files) and in coupled-model mode (reading and writing selected inputs or outputs to/from PVM mailboxes).
Since data is tagged by variable-name, simulation date, and time, the system is not subject to data scrambling because of implicit programming assumptions about the data ordering in the way that stream-like communications channels are.
Readers and writers do not need to know about each other in detail. In particular, any reader only needs to know that some writer will put the variables it needs into the mailbox. Writers don't care whether readers even exist or not. It is easy to change system configuration by just adding additional processes or by deleting processes and replacing them by appropriate disk-based files containing the data that would have been produced. In MCNC's Real-Time Ozone Forecast System, for example, the set of programs that runs to compute each day's ozone forecast varies from day to day, on the basis of such things as whether particular data ingest feeds have succeeded or failed over the past two days.
One writer can supply multiple readers without special programming (and without needing to know who they are). For example, in a coupled system with the MM5/MCIP meteorology model, the SMOKE emissions model, and the MAQSIP air quality model, MM5 produces 5 time-stepped output "virtual files", some variables of two of which are read by SMOKE and all of which are read by MAQSIP; and SMOKE produces one output "virtual files" read by MAQSIP. SMOKE is itself a system of five programs coupled together by virtual files and fed by a number of additional disk-files produced off-line. MAQSIP produces a "synchronization file" read by MM5/MCIP and used to keep MM5/MCIP from running ahead and exhausting all memory available for mailbox-buffer space.

Back to Contents

Previous: MCPL I/O API output module for MM5

Next: AIRS2M3 Program

Up: I/O API Related Programs

To: Models-3/EDSS I/O API: The Help Pages

Send comments to

Carlie J. Coats, Jr.
carlie@jyarborough.com