DSMar 23

Optimal-Time Move Structure Balancing and LCP Array Computation from the RLBWT

arXiv:2603.2214776.81 citationsh-index: 21
AI Analysis

This improves efficiency for researchers and practitioners working with repetitive text collections in bioinformatics or data compression, though it is incremental as it optimizes an existing bottleneck.

The paper tackles the problem of the O(r log r)-time bottleneck in move structure balancing for RLBWT-based algorithms, presenting an optimal O(r)-time and space algorithm for balancing, which leads to an optimal O(n)-time algorithm for LCP array computation in O(r)-space.

On repetitive text collections of size $n$, the Burrows-Wheeler Transform (BWT) tends to have relatively fewer runs $r$ in its run-length encoded BWT (RLBWT). This motivates many RLBWT-related algorithms and data structures that can be designed in compressed $O(r)$-space. These approaches often use the RLBWT-derived permutations LF, FL, $ϕ$, and $ϕ^{-1}$, which can be represented using a move structure to obtain optimal $O(1)$-time for each permutation step in $O(r)$-space. They are then used to construct compressed space text indexes supporting efficient pattern matching queries. However, move structure construction in $O(r)$-space requires an $O(r \log r)$-time balancing stage. The longest common prefix array (LCP) of a text collection is used to support pattern matching queries and data structure construction. Recently, it was shown how to compute the LCP array in $O(n + r \log r)$-time and $O(r)$ additional space from an RLBWT. However, the bottleneck remains the $O(r \log r)$-time move structure balancing stage. In this paper, we describe an optimal $O(r)$-time and space algorithm to balance a move structure. This result is then applied to LCP construction from an RLBWT to obtain an optimal $O(n)$-time algorithm in $O(r)$-space in addition to the output, which implies an optimal-time algorithm for LCP array enumeration in compressed $O(r)$-space.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes