CISL Seminar Series: To Petabytes...and Beyond! Data Management with CESM

02/28/2018 - 10:00am to 11:00am
ML- Main Seminar Room

CISL Seminar Series

 To Petabytes...andBeyond!
Data Management with CESM

Gary Strand
NCAR, CGD

The Community Earth System Model (CESM) is a state-of-the-art climate model and is NCAR’s flagship climate model. It is used to simulate the Earth’s climate system, from the distant past into the future, and to investigate the processes underlying the climate system. Simulations done with CESM, depending on the specific model configuration, can replicate time periods from as short as few days to tens of thousands of years.

In the last few years, the use of large ensembles of CESM simulations has become more common, in which the identical model configuration is run from dozens to thousands of times. As a result, large volumes of model data are generated, from tens of terabytes to over a petabyte from a single project. The management and analysis of the output from these petascale projects can be a daunting task.

The talk will go over the past, present and future of the data engineering and management of CESM data. I'll focus on the tools that I use to handle the scale and complexity of these data, and their application to some recent CESM petascale projects, such as the LENS, LME, DPLE, and GLENS. The upcoming set of simulations for CMIP6 and the future directions of large-scale data engineering and management within CESM will also be discussed. I'll also talk about my views regarding best practices in CESM data management, and the policies that guide and influence CESM data management, in the present and in the future.

Biography

Gary Strand is a software engineer in the Climate Change Prediction group of the Climate and Global Dynamics Laboratory (CGD) of NCAR. He began work at NCAR as a student assistant, and has been involved in several generations of climate model development in CGD. He is the primary data manager and data scientist for the NCAR climate model, the Community Earth System Model (CESM). He has led major data management activities and projects for the CESM since 2003, including CMIP3, CMIP5 and other large-scale CESM projects.


Wednesday, February 28, 2018
10:00 a.m. - 11:00 a.m.
Mesa Lab, Main Seminar Room