APLGMLAug 30, 2024

Data is missing again -- Reconstruction of power generation data using $k$-Nearest Neighbors and spectral graph theory

arXiv:2409.00300v11 citationsh-index: 2
Originality Incremental advance
AI Analysis

This work addresses data imputation for wind farm operators, offering an incremental improvement by integrating expert knowledge into data-driven techniques.

The paper tackles the problem of missing power generation data in wind farms by proposing an imputation method that combines k-Nearest Neighbors with spectral graph theory to incorporate wind farm geometry, resulting in significant improvement over methods ignoring layout information as demonstrated on the Westermost Rough offshore wind farm.

The risk of missing data and subsequent incomplete data records at wind farms increases with the number of turbines and sensors. We propose here an imputation method that blends data-driven concepts with expert knowledge, by using the geometry of the wind farm in order to provide better estimates when performing Nearest Neighbor imputation. Our method relies on learning Laplacian eigenmaps out of the graph of the wind farm through spectral graph theory. These learned representations can be based on the wind farm layout only, or additionally account for information provided by collected data. The related weighted graph is allowed to change with time and can be tracked in an online fashion. Application to the Westermost Rough offshore wind farm shows significant improvement over approaches that do not account for the wind farm layout information.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes