DCMSNAPFNAJun 11, 2010

Highly Parallel Sparse Matrix-Matrix Multiplication

arXiv:1006.218329 citationsh-index: 45
Originality Incremental advance
AI Analysis

This work addresses the scalability bottleneck of sparse matrix-matrix multiplication for high-performance computing and graph analytics.

The paper presents the first parallel algorithms for generalized sparse matrix-matrix multiplication that achieve increasing speedups for an unbounded number of processors, with experiments showing scaling up to thousands of processors.

Generalized sparse matrix-matrix multiplication is a key primitive for many high performance graph algorithms as well as some linear solvers such as multigrid. We present the first parallel algorithms that achieve increasing speedups for an unbounded number of processors. Our algorithms are based on two-dimensional block distribution of sparse matrices where serial sections use a novel hypersparse kernel for scalability. We give a state-of-the-art MPI implementation of one of our algorithms. Our experiments show scaling up to thousands of processors on a variety of test scenarios.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes