RONov 11, 2020

FINO-Net: A Deep Multimodal Sensor Fusion Framework for Manipulation Failure Detection

arXiv:2011.05817v243 citationsHas Code
AI Analysis

This addresses the challenge of safe manipulation in unstructured environments for service robots, but it is incremental as it builds on existing sensor fusion methods.

The paper tackles the problem of detecting manipulation failures in service robots by proposing FINO-Net, a deep multimodal sensor fusion framework, achieving 98.60% detection and 87.31% classification accuracy on a new real-world dataset.

Safe manipulation in unstructured environments for service robots is a challenging problem. A failure detection system is needed to monitor and detect unintended outcomes. We propose FINO-Net, a novel multimodal sensor fusion based deep neural network to detect and identify manipulation failures. We also introduce a multimodal dataset, containing 229 real-world manipulation data recorded with a Baxter robot. Our network combines RGB, depth and audio readings to effectively detect and classify failures. Results indicate that fusing RGB with depth and audio modalities significantly improves the performance. FINO-Net achieves 98.60% detection and 87.31% classification accuracy on our novel dataset. Code and data are publicly available at https://github.com/ardai/fino-net.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes