ROCVJul 12, 2022

Robotic Detection of a Human-Comprehensible Gestural Language for Underwater Multi-Human-Robot Collaboration

arXiv:2207.05331v19 citationsh-index: 28
AI Analysis

This addresses the challenge of underwater multi-human-robot collaboration where traditional communication methods like radio are ineffective, though it is incremental as it builds on existing gesture recognition and AUV communication techniques.

The paper tackles the problem of enabling non-verbal communication between autonomous underwater vehicles (AUVs) and human divers by designing a gestural language and a deep network (RRCommNet) for recognition, achieving accuracies of 88-94% on simulated data, 73-83% on real data, and 88% human transcription accuracy.

In this paper, we present a motion-based robotic communication framework that enables non-verbal communication among autonomous underwater vehicles (AUVs) and human divers. We design a gestural language for AUV-to-AUV communication which can be easily understood by divers observing the conversation unlike typical radio frequency, light, or audio based AUV communication. To allow AUVs to visually understand a gesture from another AUV, we propose a deep network (RRCommNet) which exploits a self-attention mechanism to learn to recognize each message by extracting maximally discriminative spatio-temporal features. We train this network on diverse simulated and real-world data. Our experimental evaluations, both in simulation and in closed-water robot trials, demonstrate that the proposed RRCommNet architecture is able to decipher gesture-based messages with an average accuracy of 88-94% on simulated data, 73-83% on real data (depending on the version of the model used). Further, by performing a message transcription study with human participants, we also show that the proposed language can be understood by humans, with an overall transcription accuracy of 88%. Finally, we discuss the inference runtime of RRCommNet on embedded GPU hardware, for real-time use on board AUVs in the field.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes