CVARIVApr 22, 2022

Real-time HOG+SVM based object detection using SoC FPGA for a UHD video stream

arXiv:2204.10619v113 citationsh-index: 15
Originality Synthesis-oriented
AI Analysis

This addresses the need for energy-efficient, real-time object detection in embedded systems like ADAS and AVSS, though it is incremental as it uses an existing method on new hardware.

The paper tackled real-time pedestrian detection in 4K video streams by implementing a HOG+SVM detector on an FPGA, achieving processing at 60 frames per second with a single-scale approach.

Object detection is an essential component of many vision systems. For example, pedestrian detection is used in advanced driver assistance systems (ADAS) and advanced video surveillance systems (AVSS). Currently, most detectors use deep convolutional neural networks (e.g., the YOLO -- You Only Look Once -- family), which, however, due to their high computational complexity, are not able to process a very high-resolution video stream in real-time, especially within a limited energy budget. In this paper we present a hardware implementation of the well-known pedestrian detector with HOG (Histogram of Oriented Gradients) feature extraction and SVM (Support Vector Machine) classification. Our system running on AMD Xilinx Zynq UltraScale+ MPSoC (Multiprocessor System on Chip) device allows real-time processing of 4K resolution (UHD -- Ultra High Definition, 3840 x 2160 pixels) video for 60 frames per second. The system is capable of detecting a pedestrian in a single scale. The results obtained confirm the high suitability of reprogrammable devices in the real-time implementation of embedded vision systems.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes