REDDA: Reduced subspace in big data treatment: A new paradigm for efficient geophysical Data Assimilation

Big Data methods in geosciences

Objectives

The objective of REDDA is to contribute to the birth of novel Big Data methods capable of efficiently treating a huge amount of data while extracting as much information as possible. The project is structured along two research lines (RL). RL1 will deliver reduced-order Bayesian methods sufficiently accurate and efficient for the needs of nonlinear high-dimensional geophysical systems. RL2 will develop a data assimilation method for the new sea ice model (neXtSIM) developed at NERSC. NeXtSIM is a Lagrangian model and will assimilate Lagrangian observations of sea ice motions. On the long term, the fully-Lagrangian sea ice data assimilation system will be included into both short-term Arctic environment monitoring (TOPAZ in Copernicus) and decadal climate predictions systems (NorCPM), both already using a Monte Carlo data assimilation framework. REDDA aims at producing methods that would - by construction - be generic enough for future inclusion into complex coupled Earth System Models. 

Project Summary

Environmental science has been a primary challenge test-ground for Data Assimilation. The huge dimension of the numerical models of the climate system, the vast amount of Earth observational data at our disposal, and the pressure to deliver timely accurate forecasts, have motivated an extraordinary research activity that has led to enormous advances which have subsequently spread out to other domains of science. At the same time, geophysical DA is an exemplar of a Big Data problem: models have O(109) and the observational datasets O(108). Computationally efficient state estimation and uncertainty quantification must be carried out using massive datasets and huge dynamical models. Increasing computational power alone will not suffice to solve the issue since the problem complexity grows commensurately with both the data volume and model size, making continuous development of advanced DA procedures necessary. REDDA’s aim is to contribute to the birth of novel Big Data methods capable of efficiently treating a huge amount of data while extracting as much information as possible. REDDA is an interdisciplinary project between geoscientists and mathematicians with two research lines (RL) having their origin in climate science, but that will be investigated with a mathematical perspective:

RL1. Reduced order fully Bayesian DA methods for nonlinear systems

RL2. DA methods for Lagrangian sea-ice models

REDDA employs two postdoctoral scientists:

  1. Postdoc 1 - Colin Grudzien on RL1
  2. Postdoc 2 - New open position soon on RL2

Peer Review Publications

  1. Bocquet M, Carrassi A. Four-dimensional ensemble variational data assimilation and the unstable subspace. Tellus A: Dynamic Meteorology and Oceanography. 2017;69(1).
  2. Brajard J, Carrassi A, Bocquet M, Bertino L. Combining data assimilation and machine learning to infer unresolved scale parametrization. Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences. 2021;379(2194).
Project Details
Acronym: 
REDDA
Funding Agency: 
Research Council of Norway
Coordinating Institute: 
Nansen Environmental and Remote Sensing Center
Project Status: 
Completed