Visual-based Positioning and Pose Estimation
This addresses the need for robust visual analysis in applications like sports analytics, but it is incremental as it builds on existing techniques.
The paper tackled the problem of integrating human localization and pose estimation into a robust pipeline to handle errors in video frames, showing that tracking by detection and linear interpolation could deliver good spatial and temporal resolutions for position and pose estimations.
Recent advances in deep learning and computer vision offer an excellent opportunity to investigate high-level visual analysis tasks such as human localization and human pose estimation. Although the performance of human localization and human pose estimation has significantly improved in recent reports, they are not perfect and erroneous localization and pose estimation can be expected among video frames. Studies on the integration of these techniques into a generic pipeline that is robust to noise introduced from those errors are still lacking. This paper fills the missing study. We explored and developed two working pipelines that suited the visual-based positioning and pose estimation tasks. Analyses of the proposed pipelines were conducted on a badminton game. We showed that the concept of tracking by detection could work well, and errors in position and pose could be effectively handled by a linear interpolation technique using information from nearby frames. The results showed that the Visual-based Positioning and Pose Estimation could deliver position and pose estimations with good spatial and temporal resolutions.