DSDCLGJun 20, 2022

Scalable Distributed Algorithms for Size-Constrained Submodular Maximization in the MapReduce and Adaptive Complexity Models

arXiv:2206.09563v62 citationsh-index: 17
Originality Incremental advance
AI Analysis

This work addresses scalable optimization for large-scale data processing, but it is incremental as it builds on existing frameworks and algorithms.

The paper tackled the problem of distributed maximization of monotone submodular functions under size constraints by showing that sublinearly adaptive algorithms can be integrated into the MapReduce model, yielding practical and parallelizable solutions, and separately developed the first distributed algorithm with linear query complexity.

Distributed maximization of a submodular function in the MapReduce (MR) model has received much attention, culminating in two frameworks that allow a centralized algorithm to be run in the MR setting without loss of approximation, as long as the centralized algorithm satisfies a certain consistency property -- which had previously only been known to be satisfied by the standard greedy and continous greedy algorithms. A separate line of work has studied parallelizability of submodular maximization in the adaptive complexity model, where each thread may have access to the entire ground set. For the size-constrained maximization of a monotone and submodular function, we show that several sublinearly adaptive (highly parallelizable) algorithms satisfy the consistency property required to work in the MR setting, which yields practical, parallelizable and distributed algorithms. Separately, we develop the first distributed algorithm with linear query complexity for this problem. Finally, we provide a method to increase the maximum cardinality constraint for MR algorithms at the cost of additional MR rounds.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes