LGJun 11, 2024

Low Rank Multi-Dictionary Selection at Scale

Boya Ma, Maxwell McNeil, Abram Magner, Petko Bogdanov

arXiv:2406.06960v12.6

Originality Incremental advance

AI Analysis

This addresses a scalability bottleneck for researchers and practitioners using multi-dictionary coding in domains like image and signal processing, though it is incremental as it builds on existing sparse coding frameworks.

The paper tackles the scalability challenge of multi-dictionary sparse coding for large datasets and dictionaries by proposing LRMDS, a technique that selects atom pairs based on data alignment and uses convex relaxation, achieving 3X to 10X speed-up and up to 100X improvement in representation quality compared to baselines.

The sparse dictionary coding framework represents signals as a linear combination of a few predefined dictionary atoms. It has been employed for images, time series, graph signals and recently for 2-way (or 2D) spatio-temporal data employing jointly temporal and spatial dictionaries. Large and over-complete dictionaries enable high-quality models, but also pose scalability challenges which are exacerbated in multi-dictionary settings. Hence, an important problem that we address in this paper is: How to scale multi-dictionary coding for large dictionaries and datasets? We propose a multi-dictionary atom selection technique for low-rank sparse coding named LRMDS. To enable scalability to large dictionaries and datasets, it progressively selects groups of row-column atom pairs based on their alignment with the data and performs convex relaxation coding via the corresponding sub-dictionaries. We demonstrate both theoretically and experimentally that when the data has a low-rank encoding with a sparse subset of the atoms, LRMDS is able to select them with strong guarantees under mild assumptions. Furthermore, we demonstrate the scalability and quality of LRMDS in both synthetic and real-world datasets and for a range of coding dictionaries. It achieves 3X to 10X speed-up compared to baselines, while obtaining up to two orders of magnitude improvement in representation quality on some of the real world datasets given a fixed target number of atoms.

View on arXiv PDF

Similar