CVAILGJun 20, 2024

Seg-LSTM: Performance of xLSTM for Semantic Segmentation of Remotely Sensed Images

arXiv:2406.14086v112 citationsHas Code
Originality Synthesis-oriented
AI Analysis

This is an incremental study assessing a new application of an existing method (xLSTM) to a domain-specific task (remote sensing segmentation), with negative results.

The study evaluated Vision-LSTM for semantic segmentation of remotely sensed images using a Seg-LSTM architecture and found its performance limited, generally inferior to Vision-Transformers and Vision-Mamba models in most tests.

Recent advancements in autoregressive networks with linear complexity have driven significant research progress, demonstrating exceptional performance in large language models. A representative model is the Extended Long Short-Term Memory (xLSTM), which incorporates gating mechanisms and memory structures, performing comparably to Transformer architectures in long-sequence language tasks. Autoregressive networks such as xLSTM can utilize image serialization to extend their application to visual tasks such as classification and segmentation. Although existing studies have demonstrated Vision-LSTM's impressive results in image classification, its performance in image semantic segmentation remains unverified. Our study represents the first attempt to evaluate the effectiveness of Vision-LSTM in the semantic segmentation of remotely sensed images. This evaluation is based on a specifically designed encoder-decoder architecture named Seg-LSTM, and comparisons with state-of-the-art segmentation networks. Our study found that Vision-LSTM's performance in semantic segmentation was limited and generally inferior to Vision-Transformers-based and Vision-Mamba-based models in most comparative tests. Future research directions for enhancing Vision-LSTM are recommended. The source code is available from https://github.com/zhuqinfeng1999/Seg-LSTM.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes