CVNov 6, 2024

Increasing the scalability of graph convolution for FPGA-implemented event-based vision

arXiv:2411.04269v21 citationsh-index: 15FPT
Originality Incremental advance
AI Analysis

This work addresses scalability challenges for deploying GCNNs on FPGAs in mobile robotics and event-based vision, though it is incremental as it builds on existing hardware optimization trends.

The paper tackled the problem of scaling graph convolutional neural networks (GCNNs) for FPGA-implemented event-based vision by optimizing hardware modules to reduce resource usage, achieving up to a 94% reduction in LUT usage for multiplications.

Event cameras are becoming increasingly popular as an alternative to traditional frame-based vision sensors, especially in mobile robotics. Taking full advantage of their high temporal resolution, high dynamic range, low power consumption and sparsity of event data, which only reflects changes in the observed scene, requires both an efficient algorithm and a specialised hardware platform. A recent trend involves using Graph Convolutional Neural Networks (GCNNs) implemented on a heterogeneous SoC FPGA. In this paper we focus on optimising hardware modules for graph convolution to allow flexible selection of the FPGA resource (BlockRAM, DSP and LUT) for their implementation. We propose a ''two-step convolution'' approach that utilises additional BRAM buffers in order to reduce up to 94% of LUT usage for multiplications. This method significantly improves the scalability of GCNNs, enabling the deployment of models with more layers, larger graphs sizes and their application for more dynamic scenarios.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes