LGJul 19, 2023
PreDiff: Precipitation Nowcasting with Latent Diffusion ModelsZhihan Gao, Xingjian Shi, Boran Han et al. · amazon-science
Earth system forecasting has traditionally relied on complex physical models that are computationally expensive and require significant domain expertise. In the past decade, the unprecedented increase in spatiotemporal Earth observation data has enabled data-driven forecasting models using deep learning techniques. These models have shown promise for diverse Earth system forecasting tasks but either struggle with handling uncertainty or neglect domain-specific prior knowledge, resulting in averaging possible futures to blurred forecasts or generating physically implausible predictions. To address these limitations, we propose a two-stage pipeline for probabilistic spatiotemporal forecasting: 1) We develop PreDiff, a conditional latent diffusion model capable of probabilistic forecasts. 2) We incorporate an explicit knowledge alignment mechanism to align forecasts with domain-specific physical constraints. This is achieved by estimating the deviation from imposed constraints at each denoising step and adjusting the transition distribution accordingly. We conduct empirical studies on two datasets: N-body MNIST, a synthetic dataset with chaotic behavior, and SEVIR, a real-world precipitation nowcasting dataset. Specifically, we impose the law of conservation of energy in N-body MNIST and anticipated precipitation intensity in SEVIR. Experiments demonstrate the effectiveness of PreDiff in handling uncertainty, incorporating domain-specific prior knowledge, and generating forecasts that exhibit high operational utility.
FLU-DYNAug 21, 2024Code
DrivAerML: High-Fidelity Computational Fluid Dynamics Dataset for Road-Car External AerodynamicsNeil Ashton, Charles Mockett, Marian Fuchs et al.
Machine Learning (ML) has the potential to revolutionise the field of automotive aerodynamics, enabling split-second flow predictions early in the design process. However, the lack of open-source training data for realistic road cars, using high-fidelity CFD methods, represents a barrier to their development. To address this, a high-fidelity open-source (CC-BY-SA) public dataset for automotive aerodynamics has been generated, based on 500 parametrically morphed variants of the widely-used DrivAer notchback generic vehicle. Mesh generation and scale-resolving CFD was executed using consistent and validated automatic workflows representative of the industrial state-of-the-art. Geometries and rich aerodynamic data are published in open-source formats. To our knowledge, this is the first large, public-domain dataset for complex automotive configurations generated using high-fidelity CFD.
NAFeb 13, 2018
Numerical Artifacts in the Generalized Porous Medium Equation: Why Harmonic Averaging Itself Is Not to BlameDanielle Maddix, Luiz Sampaio, Margot Gerritsen
The degenerate parabolic Generalized Porous Medium Equation (GPME) poses numerical challenges due to self-sharpening and its sharp corner solutions. For these problems, we show results for two subclasses of the GPME with differentiable $k(p)$ with respect to $p$, namely the Porous Medium Equation (PME) and the superslow diffusion equation. Spurious temporal oscillations, and nonphysical locking and lagging have been reported in the literature. These issues have been attributed to harmonic averaging of the coefficient $k(p)$ for small $p$, and arithmetic averaging has been suggested as an alternative. We show that harmonic averaging is not solely responsible and that an improved discretization can mitigate these issues. Here, we investigate the causes of these numerical artifacts using modified equation analysis. The modified equation framework can be used for any type of discretization. We show results for the second order finite volume method. The observed problems with harmonic averaging can be traced to two leading error terms in its modified equation. This is also illustrated numerically through a Modified Harmonic Method (MHM) that can locally modify the critical terms to remove the aforementioned numerical artifacts.
NAApr 27, 2018
Numerical Artifacts in the Discontinuous Generalized Porous Medium Equation: How to Avoid Spurious Temporal OscillationsDanielle Maddix, Luiz Sampaio, Margot Gerritsen
Numerical discretizations of the Generalized Porous Medium Equation (GPME) with discontinuous coefficients are analyzed with respect to the formation of numerical artifacts. In addition to the degeneracy and self-sharpening of the GPME with continuous coefficients, detailed in [1], increased numerical challenges occur in the discontinuous coefficients case. These numerical challenges manifest themselves in spurious temporal oscillations in second order finite volume discretizations with both arithmetic and harmonic averaging. The integral average, developed in [2] leads to improved solutions with monotone and reduced amplitude temporal oscillations. In this paper, we propose a new method called the Shock-Based Averaging Method (SAM) that incorporates the shock position into the numerical scheme. The shock position is numerically calculated by discretizing the theoretical speed of the front from the GPME theory. The speed satisfies the jump condition for integral conservation laws. SAM results in a non-oscillatory temporal profile, producing physically valid numerical results. We use SAM to demonstrate that the choice of averaging alone is not the cause of the oscillations, and that the shock position must be a part of the numerical scheme to avoid the artifacts.
LGDec 18, 2021
GOPHER: Categorical probabilistic forecasting with graph structure via local continuous-time dynamicsKe Alexander Wang, Danielle Maddix, Yuyang Wang
We consider the problem of probabilistic forecasting over categories with graph structure, where the dynamics at a vertex depends on its local connectivity structure. We present GOPHER, a method that combines the inductive bias of graph neural networks with neural ODEs to capture the intrinsic local continuous-time dynamics of our probabilistic forecasts. We study the benefits of these two inductive biases by comparing against baseline models that help disentangle the benefits of each. We find that capturing the graph structure is crucial for accurate in-domain probabilistic predictions and more sample efficient models. Surprisingly, our experiments demonstrate that the continuous time evolution inductive bias brings little to no benefit despite reflecting the true probability dynamics.
LGNov 12, 2021
Learning Quantile Functions without Quantile Crossing for Distribution-free Time Series ForecastingYoungsuk Park, Danielle Maddix, François-Xavier Aubet et al.
Quantile regression is an effective technique to quantify uncertainty, fit challenging underlying distributions, and often provide full probabilistic predictions through joint learnings over multiple quantile levels. A common drawback of these joint quantile regressions, however, is \textit{quantile crossing}, which violates the desirable monotone property of the conditional quantile function. In this work, we propose the Incremental (Spline) Quantile Functions I(S)QF, a flexible and efficient distribution-free quantile estimation framework that resolves quantile crossing with a simple neural network layer. Moreover, I(S)QF inter/extrapolate to predict arbitrary quantile levels that differ from the underlying training ones. Equipped with the analytical evaluation of the continuous ranked probability score of I(S)QF representations, we apply our methods to NN-based times series forecasting cases, where the savings of the expensive re-training costs for non-trained quantile levels is particularly significant. We also provide a generalization error analysis of our proposed approaches under the sequence-to-sequence setting. Lastly, extensive experiments demonstrate the improvement of consistency and accuracy errors over other baselines.
LGNov 20, 2020
Bridging Physics-based and Data-driven modeling for Learning Dynamical SystemsRui Wang, Danielle Maddix, Christos Faloutsos et al.
How can we learn a dynamical system to make forecasts, when some variables are unobserved? For instance, in COVID-19, we want to forecast the number of infected and death cases but we do not know the count of susceptible and exposed people. While mechanics compartment models are widely used in epidemic modeling, data-driven models are emerging for disease forecasting. We first formalize the learning of physics-based models as AutoODE, which leverages automatic differentiation to estimate the model parameters. Through a benchmark study on COVID-19 forecasting, we notice that physics-based mechanistic models significantly outperform deep learning. Our method obtains a 57.4% reduction in mean absolute errors for 7-day ahead COVID-19 forecasting compared with the best deep learning competitor. Such performance differences highlight the generalization problem in dynamical system learning due to distribution shift. We identify two scenarios where distribution shift can occur: changes in data domain and changes in parameter domain (system dynamics). Through systematic experiments on several dynamical systems, we found that deep learning models fail to forecast well under both scenarios. While much research on distribution shift has focused on changes in the data domain, our work calls attention to rethink generalization for learning dynamical systems.
LGApr 21, 2020
Deep Learning for Time Series Forecasting: Tutorial and Literature SurveyKonstantinos Benidis, Syama Sundar Rangapuram, Valentin Flunkert et al.
Deep learning based forecasting methods have become the methods of choice in many applications of time series prediction or forecasting often outperforming other approaches. Consequently, over the last years, these methods are now ubiquitous in large-scale industrial forecasting applications and have consistently ranked among the best entries in forecasting competitions (e.g., M4 and M5). This practical success has further increased the academic interest to understand and improve deep forecasting methods. In this article we provide an introduction and overview of the field: We present important building blocks for deep forecasting in some depth; using these building blocks, we then survey the breadth of the recent deep forecasting literature.