MMAug 30, 2015

Compressive Video Sensing via Dictionary Learning and Forward Prediction

arXiv:1508.07640v12 citations
Originality Synthesis-oriented
AI Analysis

This is an incremental improvement in compressive video sensing for applications like video compression and transmission.

The authors tackled compressive video sensing by splitting video into key and non-key frames, using dictionary learning for spatial redundancy and forward prediction for temporal redundancy, achieving adequate results as measured by PSNR and SSIM.

In this paper, we propose a new framework for compressive video sensing (CVS) that exploits the inherent spatial and temporal redundancies of a video sequence, effectively. The proposed method splits the video sequence into the key and non-key frames followed by dividing each frame into small non-overlapping blocks of equal sizes. At the decoder side, the key frames are reconstructed using adaptively learned sparsifying (ALS) basis via $\ell_0$ minimization, in order to exploit the spatial redundancy. Also, the effectiveness of three well-known dictionary learning algorithms is investigated in our method. For recovery of the non-key frames, a prediction of the current frame is initialized, by using the previous reconstructed frame, in order to exploit the temporal redundancy. The prediction is employed in a proper optimization problem to recover the current non-key frame. To compare our experimental results with the results of some other methods, we employ peak signal to noise ratio (PSNR) and structural similarity (SSIM) index as the quality assessor. The numerical results show the adequacy of our proposed method in CVS.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes