DC LGAug 9, 2017

Enabling Massive Deep Neural Networks with the GraphBLAS

Jeremy Kepner, Manoj Kumar, José Moreira, Pratap Pattnaik, Mauricio Serrano, Henry Tufo

arXiv:1708.02937v133 citations

Originality Synthesis-oriented

AI Analysis

This work addresses memory efficiency for researchers and practitioners scaling DNNs, but it is incremental as it applies an existing library to a known problem.

The paper tackles the challenge of memory limitations in large deep neural networks by using the GraphBLAS library to handle sparse weight matrices, showing that the sparse implementation outperforms dense BLAS as matrices become sparser.

Deep Neural Networks (DNNs) have emerged as a core tool for machine learning. The computations performed during DNN training and inference are dominated by operations on the weight matrices describing the DNN. As DNNs incorporate more stages and more nodes per stage, these weight matrices may be required to be sparse because of memory limitations. The GraphBLAS.org math library standard was developed to provide high performance manipulation of sparse weight matrices and input/output vectors. For sufficiently sparse matrices, a sparse matrix library requires significantly less memory than the corresponding dense matrix implementation. This paper provides a brief description of the mathematics underlying the GraphBLAS. In addition, the equations of a typical DNN are rewritten in a form designed to use the GraphBLAS. An implementation of the DNN is given using a preliminary GraphBLAS C library. The performance of the GraphBLAS implementation is measured relative to a standard dense linear algebra library implementation. For various sizes of DNN weight matrices, it is shown that the GraphBLAS sparse implementation outperforms a BLAS dense implementation as the weight matrix becomes sparser.

View on arXiv PDF

Similar