LGSYMay 6, 2022

Application of Clustering Algorithms for Dimensionality Reduction in Infrastructure Resilience Prediction Models

arXiv:2205.03316v111 citationsh-index: 9
Originality Synthesis-oriented
AI Analysis

This work addresses the problem of high-dimensional data in infrastructure resilience prediction for decision-makers, but it is incremental as it applies existing clustering techniques to a specific domain.

The paper tackles the curse of dimensionality in machine learning models for predicting infrastructure resilience by proposing a clustering-based method to reduce features, which improves prediction accuracy in a testbed of interdependent power-water-transport networks.

Recent studies increasingly adopt simulation-based machine learning (ML) models to analyze critical infrastructure system resilience. For realistic applications, these ML models consider the component-level characteristics that influence the network response during emergencies. However, such an approach could result in a large number of features and cause ML models to suffer from the `curse of dimensionality'. We present a clustering-based method that simultaneously minimizes the problem of high-dimensionality and improves the prediction accuracy of ML models developed for resilience analysis in large-scale interdependent infrastructure networks. The methodology has three parts: (a) generation of simulation dataset, (b) network component clustering, and (c) dimensionality reduction and development of prediction models. First, an interdependent infrastructure simulation model simulates the network-wide consequences of various disruptive events. The component-level features are extracted from the simulated data. Next, clustering algorithms are used to derive the cluster-level features by grouping component-level features based on their topological and functional characteristics. Finally, ML algorithms are used to develop models that predict the network-wide impacts of disruptive events using the cluster-level features. The applicability of the method is demonstrated using an interdependent power-water-transport testbed. The proposed method can be used to develop decision-support tools for post-disaster recovery of infrastructure networks.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes