IT LGMar 11, 2025

MinGRU-Based Encoder for Turbo Autoencoder Frameworks

arXiv:2503.08451v12.31 citationsh-index: 102025 IEEE International Conference on Machine Learning for Communication and Networking (ICMLCN)

Originality Incremental advance

AI Analysis

This work addresses scalability challenges in neural channel coding for communication systems, representing an incremental improvement by combining existing efficient RNN architectures.

The paper tackled the scalability and efficiency issues of neural channel coding for long sequences by integrating a lightweight minGRU model with a Mamba block into a Turbo autoencoder framework, achieving performance matching convolutional methods for short sequences and significant improvements in scalability and training efficiency for long block lengths.

Early neural channel coding approaches leveraged dense neural networks with one-hot encodings to design adaptive encoder-decoder pairs, improving block error rate (BLER) and automating the design process. However, these methods struggled with scalability as the size of message sets and block lengths increased. TurboAE addressed this challenge by focusing on bit-sequence inputs rather than symbol-level representations, transforming the scalability issue associated with large message sets into a sequence modeling problem. While recurrent neural networks (RNNs) were a natural fit for sequence processing, their reliance on sequential computations made them computationally expensive and inefficient for long sequences. As a result, TurboAE adopted convolutional network blocks, which were faster to train and more scalable, but lacked the sequential modeling advantages of RNNs. Recent advances in efficient RNN architectures, such as minGRU and minLSTM, and structured state space models (SSMs) like S4 and S6, overcome these limitations by significantly reducing memory and computational overhead. These models enable scalable sequence processing, making RNNs competitive for long-sequence tasks. In this work, we revisit RNNs for Turbo autoencoders by integrating the lightweight minGRU model with a Mamba block from SSMs into a parallel Turbo autoencoder framework. Our results demonstrate that this hybrid design matches the performance of convolutional network-based Turbo autoencoder approaches for short sequences while significantly improving scalability and training efficiency for long block lengths. This highlights the potential of efficient RNNs in advancing neural channel coding for long-sequence scenarios.

View on arXiv PDF

Similar