SPCVJun 19, 2023

To Fold or Not to Fold: Graph Regularized Tensor Train for Visual Data Completion

arXiv:2306.11123v23 citationsh-index: 37
Originality Incremental advance
AI Analysis

This work addresses a specific bottleneck in tensor-based visual data completion for researchers in computer vision and tensor analysis.

The paper tackles the problem of local information loss in tensor train completion for visual data by exploring an unfolded approach with graph regularization to preserve local similarity, achieving superior performance on both synthetic and real-world visual data.

Tensor train (TT) representation has achieved tremendous success in visual data completion tasks, especially when it is combined with tensor folding. However, folding an image or video tensor breaks the original data structure, leading to local information loss as nearby pixels may be assigned into different dimensions and become far away from each other. In this paper, to fully preserve the local information of the original visual data, we explore not folding the data tensor, and at the same time adopt graph information to regularize local similarity between nearby entries. To overcome the high computational complexity introduced by the graph-based regularization in the TT completion problem, we propose to break the original problem into multiple sub-problems with respect to each TT core fiber, instead of each TT core as in traditional methods. Furthermore, to avoid heavy parameter tuning, a sparsity promoting probabilistic model is built based on the generalized inverse Gaussian (GIG) prior, and an inference algorithm is derived under the mean-field approximation. Experiments on both synthetic data and real-world visual data show the superiority of the proposed methods.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes