Skip to main content

Community-enabled Data-Model Integration Infrastructure (CD-MII)


EMSL Project ID
60153

Abstract

The scientific questions addressed by EMSL and BER researchers are inherently multiscale and multidisciplinary, which leads to extensive needs for integration of models and data representing a variety of physical, chemical, and biological processes across a wide range of spatial and temporal scales. While numerous computational and data resources exist, their broader use by researchers is hindered by challenges related to linking models and data across scales, limitations in accessibility to data repositories across platforms, inadequate reproducibility of scientific workflows, and exigent expertise requirements related to the maintenance and improvement of existing workflows or coupled model frameworks.

The proposed Community-enabled Data-Model Integration Infrastructure (CD-MII) will address these challenges with the specific goal of enabling the integration of numerical models and experimental data within a multiscale ModEx paradigm to support EMSL and BER scientific communities. Specifically, CD-MII efforts will: (1) give EMSL/BER users access to a BER-focused suite of available software tools, including performance optimization tools and user support; and (2) emphasize the development of generalizable scientific workflows that facilitate model coupling and data integration while allowing for future expansion. Because CD-MII will take advantage of the existing event-driven data analysis and simulation system that resides centrally on the EMSL supercomputer and was founded on the Pacifica engine that underlies the MyEMSL data repository, standardized workflows will allow CD-MII projects to be responsive to new data and simulation inputs as they become available. Additionally, CD-MII infrastructure capabilities will be tested and demonstrated through selected use cases that will directly serve existing EMSL user projects or BER-funded research.

CD-MII’s community-focused design and ability to accommodate current and future data-model integration needs will provide a critical resource to the EMSL/BER science community as it addresses challenging multiscale environmental and biological research problems.

Project Details

Start Date
2021-06-21
End Date
2023-09-30
Status
Closed

Team

Principal Investigator

Timothy Scheibe
Institution
Pacific Northwest National Laboratory

Team Members

Balwinder Singh
Institution
Pacific Northwest National Laboratory

Scott Waichler
Institution
Pacific Northwest National Laboratory

Steven Yabusaki
Institution
Pacific Northwest National Laboratory