CEAIPFAug 4, 2022

TunaOil: A Tuning Algorithm Strategy for Reservoir Simulation Workloads

arXiv:2208.02606v18 citationsh-index: 21
Originality Synthesis-oriented
AI Analysis

This work addresses a domain-specific problem for the oil and gas industry by reducing computational costs in reservoir simulation workflows, though it appears incremental as it builds on existing ensemble and machine learning techniques.

The paper tackles the problem of optimizing numerical parameters for reservoir simulations in the oil and gas industry, which is computationally expensive, and presents TunaOil, a methodology that uses a performance model to enhance this search, resulting in an average 31% improvement in workflow runtime.

Reservoir simulations for petroleum fields and seismic imaging are known as the most demanding workloads for high-performance computing (HPC) in the oil and gas (O&G) industry. The optimization of the simulator numerical parameters plays a vital role as it could save considerable computational efforts. State-of-the-art optimization techniques are based on running numerous simulations, specific for that purpose, to find good parameter candidates. However, using such an approach is highly costly in terms of time and computing resources. This work presents TunaOil, a new methodology to enhance the search for optimal numerical parameters of reservoir flow simulations using a performance model. In the O&G industry, it is common to use ensembles of models in different workflows to reduce the uncertainty associated with forecasting O&G production. We leverage the runs of those ensembles in such workflows to extract information from each simulation and optimize the numerical parameters in their subsequent runs. To validate the methodology, we implemented it in a history matching (HM) process that uses a Kalman filter algorithm to adjust an ensemble of reservoir models to match the observed data from the real field. We mine past execution logs from many simulations with different numerical configurations and build a machine learning model based on extracted features from the data. These features include properties of the reservoir models themselves, such as the number of active cells, to statistics of the simulation's behavior, such as the number of iterations of the linear solver. A sampling technique is used to query the oracle to find the numerical parameters that can reduce the elapsed time without significantly impacting the quality of the results. Our experiments show that the predictions can improve the overall HM workflow runtime on average by 31%.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes