SDAIASJul 2, 2025

Real-Time Emergency Vehicle Siren Detection with Efficient CNNs on Embedded Hardware

arXiv:2507.01563v12 citationsh-index: 52025 IEEE 6th International Symposium on the Internet of Sounds (IS2)
Originality Incremental advance
AI Analysis

This work enables distributed acoustic monitoring for emergency vehicle tracking in smart cities using low-cost embedded hardware, though it is incremental as it builds on existing CNNs and datasets.

The paper tackles real-time emergency vehicle siren detection by developing a system based on fine-tuned CNNs and curated datasets, achieving low-latency detection with improved robustness under realistic audio conditions.

We present a full-stack emergency vehicle (EV) siren detection system designed for real-time deployment on embedded hardware. The proposed approach is based on E2PANNs, a fine-tuned convolutional neural network derived from EPANNs, and optimized for binary sound event detection under urban acoustic conditions. A key contribution is the creation of curated and semantically structured datasets - AudioSet-EV, AudioSet-EV Augmented, and Unified-EV - developed using a custom AudioSet-Tools framework to overcome the low reliability of standard AudioSet annotations. The system is deployed on a Raspberry Pi 5 equipped with a high-fidelity DAC+microphone board, implementing a multithreaded inference engine with adaptive frame sizing, probability smoothing, and a decision-state machine to control false positive activations. A remote WebSocket interface provides real-time monitoring and facilitates live demonstration capabilities. Performance is evaluated using both framewise and event-based metrics across multiple configurations. Results show the system achieves low-latency detection with improved robustness under realistic audio conditions. This work demonstrates the feasibility of deploying IoS-compatible SED solutions that can form distributed acoustic monitoring networks, enabling collaborative emergency vehicle tracking across smart city infrastructures through WebSocket connectivity on low-cost edge devices.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes