LGSEMLAug 26, 2020

Hybrid Deep Neural Networks to Infer State Models of Black-Box Systems

arXiv:2008.11856v1
Originality Incremental advance
AI Analysis

This work addresses the challenge of automated software engineering tasks like program comprehension and testing for systems where source code access is limited, offering a practical solution with measurable gains.

The paper tackles the problem of inferring state models from black-box systems without source code instrumentation by proposing a hybrid deep neural network that processes multiple time series signals. The approach improved state change point detection by up to 102% in F1 score and achieved an average 90.45% F1 score for state classification, outperforming traditional methods by up to 17% on a real UAV autopilot system.

Inferring behavior model of a running software system is quite useful for several automated software engineering tasks, such as program comprehension, anomaly detection, and testing. Most existing dynamic model inference techniques are white-box, i.e., they require source code to be instrumented to get run-time traces. However, in many systems, instrumenting the entire source code is not possible (e.g., when using black-box third-party libraries) or might be very costly. Unfortunately, most black-box techniques that detect states over time are either univariate, or make assumptions on the data distribution, or have limited power for learning over a long period of past behavior. To overcome the above issues, in this paper, we propose a hybrid deep neural network that accepts as input a set of time series, one per input/output signal of the system, and applies a set of convolutional and recurrent layers to learn the non-linear correlations between signals and the patterns, over time. We have applied our approach on a real UAV auto-pilot solution from our industry partner with half a million lines of C code. We ran 888 random recent system-level test cases and inferred states, over time. Our comparison with several traditional time series change point detection techniques showed that our approach improves their performance by up to 102%, in terms of finding state change points, measured by F1 score. We also showed that our state classification algorithm provides on average 90.45% F1 score, which improves traditional classification algorithms by up to 17%.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes