AISep 20, 2023

A Comprehensive Survey on Rare Event Prediction

arXiv:2309.11356v263 citationsh-index: 105
Originality Synthesis-oriented
AI Analysis

It addresses the problem of predicting low-probability events for practitioners and researchers in fields like Industry 4.0, but it is incremental as a survey paper.

This paper surveys rare event prediction in machine learning, reviewing 73 datasets and categorizing approaches across data processing, algorithms, and evaluation to identify gaps and suggest future research directions.

Rare event prediction involves identifying and forecasting events with a low probability using machine learning (ML) and data analysis. Due to the imbalanced data distributions, where the frequency of common events vastly outweighs that of rare events, it requires using specialized methods within each step of the ML pipeline, i.e., from data processing to algorithms to evaluation protocols. Predicting the occurrences of rare events is important for real-world applications, such as Industry 4.0, and is an active research area in statistical and ML. This paper comprehensively reviews the current approaches for rare event prediction along four dimensions: rare event data, data processing, algorithmic approaches, and evaluation approaches. Specifically, we consider 73 datasets from different modalities (i.e., numerical, image, text, and audio), four major categories of data processing, five major algorithmic groupings, and two broader evaluation approaches. This paper aims to identify gaps in the current literature and highlight the challenges of predicting rare events. It also suggests potential research directions, which can help guide practitioners and researchers.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes