LGDBJul 17, 2023

LearnedSort as a learning-augmented SampleSort: Analysis and Parallelization

arXiv:2307.08637v16 citationsh-index: 3
Originality Incremental advance
AI Analysis

This work addresses the problem of efficient parallel sorting for large datasets, presenting an incremental improvement over existing methods.

The paper analyzed LearnedSort as a learning-augmented SampleSort and developed a parallel version by combining it with IPS4o, showing improved parallel performance on synthetic and real-world datasets compared to IPS4o and other algorithms.

This work analyzes and parallelizes LearnedSort, the novel algorithm that sorts using machine learning models based on the cumulative distribution function. LearnedSort is analyzed under the lens of algorithms with predictions, and it is argued that LearnedSort is a learning-augmented SampleSort. A parallel LearnedSort algorithm is developed combining LearnedSort with the state-of-the-art SampleSort implementation, IPS4o. Benchmarks on synthetic and real-world datasets demonstrate improved parallel performance for parallel LearnedSort compared to IPS4o and other sorting algorithms.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes