SP CVJun 19, 2023

To Fold or Not to Fold: Graph Regularized Tensor Train for Visual Data Completion

Le Xu, Lei Cheng, Ngai Wong, Yik-Chung Wu

arXiv:2306.11123v23.33 citationsh-index: 37Has Code

Originality Incremental advance

AI Analysis

This work addresses a specific bottleneck in tensor-based visual data completion for researchers in computer vision and tensor analysis.

The paper tackles the problem of local information loss in tensor train completion for visual data by exploring an unfolded approach with graph regularization to preserve local similarity, achieving superior performance on both synthetic and real-world visual data.

Tensor train (TT) representation has achieved tremendous success in visual data completion tasks, especially when it is combined with tensor folding. However, folding an image or video tensor breaks the original data structure, leading to local information loss as nearby pixels may be assigned into different dimensions and become far away from each other. In this paper, to fully preserve the local information of the original visual data, we explore not folding the data tensor, and at the same time adopt graph information to regularize local similarity between nearby entries. To overcome the high computational complexity introduced by the graph-based regularization in the TT completion problem, we propose to break the original problem into multiple sub-problems with respect to each TT core fiber, instead of each TT core as in traditional methods. Furthermore, to avoid heavy parameter tuning, a sparsity promoting probabilistic model is built based on the generalized inverse Gaussian (GIG) prior, and an inference algorithm is derived under the mean-field approximation. Experiments on both synthetic data and real-world visual data show the superiority of the proposed methods.

View on arXiv PDF Code

Similar