CVMay 23, 2024

Deep Convolutional Neural Networks Meet Variational Shape Compactness Priors for Image Segmentation

arXiv:2406.19400v17 citationsh-index: 5Neurocomputing
Originality Incremental advance
AI Analysis

This work addresses computational inefficiency and hyperparameter tuning in image segmentation for noisy datasets, offering incremental improvements in deep learning methods.

The paper tackles the problem of image segmentation by incorporating shape compactness priors, proposing two novel optimization algorithms that improve computational efficiency and segmentation accuracy, achieving a 20% increase in IoU on noisy iris datasets.

Shape compactness is a key geometrical property to describe interesting regions in many image segmentation tasks. In this paper, we propose two novel algorithms to solve the introduced image segmentation problem that incorporates a shape-compactness prior. Existing algorithms for such a problem often suffer from computational inefficiency, difficulty in reaching a local minimum, and the need to fine-tune the hyperparameters. To address these issues, we propose a novel optimization model along with its equivalent primal-dual model and introduce a new optimization algorithm based on primal-dual threshold dynamics (PD-TD). Additionally, we relax the solution constraint and propose another novel primal-dual soft threshold-dynamics algorithm (PD-STD) to achieve superior performance. Based on the variational explanation of the sigmoid layer, the proposed PD-STD algorithm can be integrated into Deep Neural Networks (DNNs) to enforce compact regions as image segmentation results. Compared to existing deep learning methods, extensive experiments demonstrated that the proposed algorithms outperformed state-of-the-art algorithms in numerical efficiency and effectiveness, especially while applying to the popular networks of DeepLabV3 and IrisParseNet with higher IoU, dice, and compactness metrics on noisy Iris datasets. In particular, the proposed algorithms significantly improve IoU by 20% training on a highly noisy image dataset.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes