LGMLApr 9, 2018

Anomaly Detection for Industrial Big Data

arXiv:1804.02998v11 citations
Originality Synthesis-oriented
AI Analysis

This addresses the need for scalable data-driven methods in industrial condition monitoring, but it appears incremental as it builds on existing data-driven paradigms without claiming major breakthroughs.

The paper tackles the challenge of analyzing Industrial Big Data from IIoT sensors for anomaly detection, presenting a prototype technique that generalizes to multivariate datasets using bootstrapped partitions and ordinal distances.

As the Industrial Internet of Things (IIoT) grows, systems are increasingly being monitored by arrays of sensors returning time-series data at ever-increasing 'volume, velocity and variety' (i.e. Industrial Big Data). An obvious use for these data is real-time systems condition monitoring and prognostic time to failure analysis (remaining useful life, RUL). (e.g. See white papers by Senseye.io, and output of the NASA Prognostics Center of Excellence (PCoE).) However, as noted by Agrawal and Choudhary 'Our ability to collect "big data" has greatly surpassed our capability to analyze it, underscoring the emergence of the fourth paradigm of science, which is data-driven discovery.' In order to fully utilize the potential of Industrial Big Data we need data-driven techniques that operate at scales that process models cannot. Here we present a prototype technique for data-driven anomaly detection to operate at industrial scale. The method generalizes to application with almost any multivariate dataset based on independent ordinations of repeated (bootstrapped) partitions of the dataset and inspection of the joint distribution of ordinal distances.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes