Discriminative Feature Representation with Spatio-temporal Cues for Vehicle Re-identification
This work is significant for improving vehicle re-identification accuracy, which is crucial for security surveillance and traffic management applications, by addressing the challenges of similar vehicle appearances and varying environmental conditions.
The paper addresses vehicle re-identification by proposing a novel discriminative feature representation (DFR-ST) that integrates appearance and spatio-temporal information. This approach, utilizing a two-stream architecture for multi-grained visual representation and a spatio-temporal metric, achieved state-of-the-art performance on two public datasets.
Vehicle re-identification (re-ID) aims to discover and match the target vehicles from a gallery image set taken by different cameras on a wide range of road networks. It is crucial for lots of applications such as security surveillance and traffic management. The remarkably similar appearances of distinct vehicles and the significant changes of viewpoints and illumination conditions take grand challenges to vehicle re-ID. Conventional solutions focus on designing global visual appearances without sufficient consideration of vehicles' spatiotamporal relationships in different images. In this paper, we propose a novel discriminative feature representation with spatiotemporal clues (DFR-ST) for vehicle re-ID. It is capable of building robust features in the embedding space by involving appearance and spatio-temporal information. Based on this multi-modal information, the proposed DFR-ST constructs an appearance model for a multi-grained visual representation by a two-stream architecture and a spatio-temporal metric to provide complementary information. Experimental results on two public datasets demonstrate DFR-ST outperforms the state-of-the-art methods, which validate the effectiveness of the proposed method.