CVJul 7, 2024

DTR: A Unified Deep Tensor Representation Framework for Multimedia Data Recovery

Ting-Wei Zhou, Xi-Le Zhao, Jian-Li Wang, Yi-Si Luo, Min Wang, Xiao-Xuan Bai, Hong Yan

arXiv:2407.05267v12.02 citationsh-index: 8

Originality Highly original

AI Analysis

This work addresses the underexplored characterization aspect in transform-based tensor representations for multimedia data recovery, offering a novel framework that enhances recovery quality, particularly for fine details, which is important for applications in image and video processing.

The authors tackled the problem of multimedia data recovery by proposing a unified Deep Tensor Representation (DTR) framework that combines deep latent generative and transform modules to better characterize transformed tensors, achieving superior performance in multi-dimensional data recovery tasks with improvements in fine details recovery compared to state-of-the-art methods.

Recently, the transform-based tensor representation has attracted increasing attention in multimedia data (e.g., images and videos) recovery problems, which consists of two indispensable components, i.e., transform and characterization. Previously, the development of transform-based tensor representation mainly focuses on the transform aspect. Although several attempts consider using shallow matrix factorization (e.g., singular value decomposition and negative matrix factorization) to characterize the frontal slices of transformed tensor (termed as latent tensor), the faithful characterization aspect is underexplored. To address this issue, we propose a unified Deep Tensor Representation (termed as DTR) framework by synergistically combining the deep latent generative module and the deep transform module. Especially, the deep latent generative module can faithfully generate the latent tensor as compared with shallow matrix factorization. The new DTR framework not only allows us to better understand the classic shallow representations, but also leads us to explore new representation. To examine the representation ability of the proposed DTR, we consider the representative multi-dimensional data recovery task and suggest an unsupervised DTR-based multi-dimensional data recovery model. Extensive experiments demonstrate that DTR achieves superior performance compared to state-of-the-art methods in both quantitative and qualitative aspects, especially for fine details recovery.

View on arXiv PDF

Similar