CRLGAug 26, 2020

SIGL: Securing Software Installations Through Deep Graph Learning

arXiv:2008.11533v283 citations
AI Analysis

This addresses supply-chain security for software users by providing a robust detection tool, though it is an incremental improvement over existing methods.

The paper tackles the problem of detecting malicious behavior during software installation by introducing SIGL, a tool that uses deep graph learning on system call traces, achieving 96% detection accuracy on a test corpus of 625 malicious installers and outperforming existing systems by up to 87% in precision and recall.

Many users implicitly assume that software can only be exploited after it is installed. However, recent supply-chain attacks demonstrate that application integrity must be ensured during installation itself. We introduce SIGL, a new tool for detecting malicious behavior during software installation. SIGL collects traces of system call activity, building a data provenance graph that it analyzes using a novel autoencoder architecture with a graph long short-term memory network (graph LSTM) for the encoder and a standard multilayer perceptron for the decoder. SIGL flags suspicious installations as well as the specific installation-time processes that are likely to be malicious. Using a test corpus of 625 malicious installers containing real-world malware, we demonstrate that SIGL has a detection accuracy of 96%, outperforming similar systems from industry and academia by up to 87% in precision and recall and 45% in accuracy. We also demonstrate that SIGL can pinpoint the processes most likely to have triggered malicious behavior, works on different audit platforms and operating systems, and is robust to training data contamination and adversarial attack. It can be used with application-specific models, even in the presence of new software versions, as well as application-agnostic meta-models that encompass a wide range of applications and installers.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes