LGMLSep 12, 2017

High-Dimensional Dependency Structure Learning for Physical Processes

arXiv:1709.03891v16 citations
Originality Incremental advance
AI Analysis

This work addresses the problem of dependency structure learning for high-dimensional physical data, offering a more stable and efficient method for researchers in geoscience and related fields, though it appears incremental as it builds on existing sparse regression approaches.

The paper tackles the challenge of learning dependency structures in high-dimensional physical processes, such as atmospheric phenomena, by introducing ACLIME-ADMM, an efficient two-step algorithm that adaptively estimates edge-specific parameters and learns structures using ADMM. On real 50-year geopotential height data, it recovers atmospheric circulation patterns, including wind direction switches at the equator and tropics.

In this paper, we consider the use of structure learning methods for probabilistic graphical models to identify statistical dependencies in high-dimensional physical processes. Such processes are often synthetically characterized using PDEs (partial differential equations) and are observed in a variety of natural phenomena, including geoscience data capturing atmospheric and hydrological phenomena. Classical structure learning approaches such as the PC algorithm and variants are challenging to apply due to their high computational and sample requirements. Modern approaches, often based on sparse regression and variants, do come with finite sample guarantees, but are usually highly sensitive to the choice of hyper-parameters, e.g., parameter $λ$ for sparsity inducing constraint or regularization. In this paper, we present ACLIME-ADMM, an efficient two-step algorithm for adaptive structure learning, which estimates an edge specific parameter $λ_{ij}$ in the first step, and uses these parameters to learn the structure in the second step. Both steps of our algorithm use (inexact) ADMM to solve suitable linear programs, and all iterations can be done in closed form in an efficient block parallel manner. We compare ACLIME-ADMM with baselines on both synthetic data simulated by partial differential equations (PDEs) that model advection-diffusion processes, and real data (50 years) of daily global geopotential heights to study information flow in the atmosphere. ACLIME-ADMM is shown to be efficient, stable, and competitive, usually better than the baselines especially on difficult problems. On real data, ACLIME-ADMM recovers the underlying structure of global atmospheric circulation, including switches in wind directions at the equator and tropics entirely from the data.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes