CVOct 8, 2022

Towards Light Weight Object Detection System

arXiv:2210.03861v11 citationsh-index: 3
Originality Incremental advance
AI Analysis

This work addresses efficiency challenges for deploying object detection in resource-constrained environments, presenting incremental improvements.

The paper tackles the high latency of transformers in lightweight object detection by approximating self-attention layers to reduce latency with minimal accuracy loss, and introduces a transformer encoder for multi-resolution feature fusion to improve accuracy without adding many parameters.

Transformers are a popular choice for classification tasks and as backbones for object detection tasks. However, their high latency brings challenges in their adaptation to lightweight object detection systems. We present an approximation of the self-attention layers used in the transformer architecture. This approximation reduces the latency of the classification system while incurring minimal loss in accuracy. We also present a method that uses a transformer encoder layer for multi-resolution feature fusion. This feature fusion improves the accuracy of the state-of-the-art lightweight object detection system without significantly increasing the number of parameters. Finally, we provide an abstraction for the transformer architecture called Generalized Transformer (gFormer) that can guide the design of novel transformer-like architectures.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes