CVAIROJul 15, 2018

Is the Pedestrian going to Cross? Answering by 2D Pose Estimation

arXiv:1807.10580v1142 citations
Originality Synthesis-oriented
AI Analysis

This work addresses pedestrian safety for autonomous driving systems, but it is incremental as it applies an existing method to a new dataset.

The paper tackles the problem of predicting pedestrian crossing intentions using 2D pose estimation from monocular images, achieving new state-of-the-art results on the JAAD dataset.

Our recent work suggests that, thanks to nowadays powerful CNNs, image-based 2D pose estimation is a promising cue for determining pedestrian intentions such as crossing the road in the path of the ego-vehicle, stopping before entering the road, and starting to walk or bending towards the road. This statement is based on the results obtained on non-naturalistic sequences (Daimler dataset), i.e. in sequences choreographed specifically for performing the study. Fortunately, a new publicly available dataset (JAAD) has appeared recently to allow developing methods for detecting pedestrian intentions in naturalistic driving conditions; more specifically, for addressing the relevant question is the pedestrian going to cross? Accordingly, in this paper we use JAAD to assess the usefulness of 2D pose estimation for answering such a question. We combine CNN-based pedestrian detection, tracking and pose estimation to predict the crossing action from monocular images. Overall, the proposed pipeline provides new state-of-the-art results.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes