4.2.Data Repositories for CCSM data

CCSM control and simulation integrations are being carried out. No one
center can support all of this CCSM data. Because a significant number of
CCSM simulations will be produced at sites outside of NCAR's SCD, guidelines
for data storage must be formulated. Consideration must be given to where
and raw and post-processed data are stored permanently and under what
conditions data should be moved to other sites.
At the present time all history data being generated at NERSC is being
stored at NERSC and is being moved back to NCAR. While necessary in the
final development stages of CCSM2, it is not a desirable situation to have
the full set of first generation history data mirrored at two sites. A plan
should be developed which has a priority of archiving first generation data
at a single site.
Recommendation:
The CDMG should develop a formal policy outlining how CCSM data assets are
to be managed.
Data produced by the CCSM are to be stored, managed and distributed by the
data archive center associated with the site that carried out the
integration. When accessing the data, the details of the locations of
these distributed data should be transparent to the user.
Possible Supporting Policy:
The primary repository for any CCSM data will be the data center appointed
by the entity sponsoring the CCSM run that produces the data. CCSM data
created at NCAR under NSF support will be archived on the NCAR Mass
Storage System (MSS). CCSM data generated under external sponsorship at
non-NCAR facilities are expected to serve data created at those sites, if
appropriate, or arrangements made to serve this data from one of the core
CCSM data repository sites. CCSM data created at non-NCAR sites can be
archived from the NCAR MSS if prior arrangements have been made with NCAR
management.
+^

  • No labels