Cheng Wang

2.0CVFeb 28, 2024

G4G:A Generic Framework for High Fidelity Talking Face Generation with Fine-grained Intra-modal Alignment

Juan Zhang, Jiahao Chen, Cheng Wang et al.

Despite numerous completed studies, achieving high fidelity talking face generation with highly synchronized lip movements corresponding to arbitrary audio remains a significant challenge in the field. The shortcomings of published studies continue to confuse many researchers. This paper introduces G4G, a generic framework for high fidelity talking face generation with fine-grained intra-modal alignment. G4G can reenact the high fidelity of original video while producing highly synchronized lip movements regardless of given audio tones or volumes. The key to G4G's success is the use of a diagonal matrix to enhance the ordinary alignment of audio-image intra-modal features, which significantly increases the comparative learning between positive and negative samples. Additionally, a multi-scaled supervision module is introduced to comprehensively reenact the perceptional fidelity of original video across the facial region while emphasizing the synchronization of lip movements and the input audio. A fusion network is then used to further fuse the facial region and the rest. Our experimental results demonstrate significant achievements in reenactment of original video quality as well as highly synchronized talking lips. G4G is an outperforming generic framework that can produce talking videos competitively closer to ground truth level than current state-of-the-art methods.

1.2NAJun 27, 2017

A Second-Order Energy Stable Backward Differentiation Formula Method for the Epitaxial Thin Film Equation with Slope Selection

Wenqiang Feng, Cheng Wang, Steven M. Wise et al.

In this paper, we study a novel second-order energy stable Backward Differentiation Formula (BDF) finite difference scheme for the epitaxial thin film equation with slope selection (SS). One major challenge for the higher oder in time temporal discretization is how to ensure an unconditional energy stability and an efficient numerical implementation. We propose a general framework for designing the higher order in time numerical scheme with unconditional energy stability by using the BDF method with constant coefficient stabilized terms. Based on the unconditional energy stability property, we derive an $L^\infty_h (0,T; H_{h}^2)$ stability for the numerical solution and provide an optimal the convergence analysis. To deal with the 4-Laplacian solver in an $L^{2}$ gradient flow at each time step, we apply an efficient preconditioned steepest descent algorithm and preconditioned nonlinear conjugate gradient algorithm to solve the corresponding nonlinear system. Various numerical simulations are present to demonstrate the stability and efficiency of the proposed schemes and slovers.

Cheng Wang

2 Papers