CVJun 23, 2022
BlazePose GHUM Holistic: Real-time 3D Human Landmarks and Pose EstimationIvan Grishchenko, Valentin Bazarevsky, Andrei Zanfir et al.
We present BlazePose GHUM Holistic, a lightweight neural network pipeline for 3D human body landmarks and pose estimation, specifically tailored to real-time on-device inference. BlazePose GHUM Holistic enables motion capture from a single RGB image including avatar control, fitness tracking and AR/VR effects. Our main contributions include i) a novel method for 3D ground truth data acquisition, ii) updated 3D body tracking with additional hand landmarks and iii) full body pose estimation from a monocular image.
CVJun 19, 2020
Real-time Pupil Tracking from Monocular Video for Digital PuppetryArtsiom Ablavatski, Andrey Vakunov, Ivan Grishchenko et al.
We present a simple, real-time approach for pupil tracking from live video on mobile devices. Our method extends a state-of-the-art face mesh detector with two new components: a tiny neural network that predicts positions of the pupils in 2D, and a displacement-based estimation of the pupil blend shape coefficients. Our technique can be used to accurately control the pupil movements of a virtual puppet, and lends liveliness and energy to it. The proposed approach runs at over 50 FPS on modern phones, and enables its usage in any real-time puppeteering pipeline.