LGAIApr 23, 2024

Large-Scale Multipurpose Benchmark Datasets For Assessing Data-Driven Deep Learning Approaches For Water Distribution Networks

arXiv:2404.15386v13 citationsh-index: 4The 3rd International Joint Conference on Water Distribution Systems Analysis & Computing and Control for the Water Industry (WDSA/CCWI 2024)
Originality Synthesis-oriented
AI Analysis

This addresses a data availability problem for researchers and practitioners in water infrastructure management, though it is incremental as it compiles existing data rather than introducing new methods.

The authors tackled the lack of ready-to-use benchmark datasets for data-driven deep learning in water distribution networks by providing a collection of datasets, including 1,394,400 hours of operational data from multiple networks.

Currently, the number of common benchmark datasets that researchers can use straight away for assessing data-driven deep learning approaches is very limited. Most studies provide data as configuration files. It is still up to each practitioner to follow a particular data generation method and run computationally intensive simulations to obtain usable data for model training and evaluation. In this work, we provide a collection of datasets that includes several small and medium size publicly available Water Distribution Networks (WDNs), including Anytown, Modena, Balerma, C-Town, D-Town, L-Town, Ky1, Ky6, Ky8, and Ky13. In total 1,394,400 hours of WDNs data operating under normal conditions is made available to the community.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes