ROCVMar 22, 2024

PseudoTouch: Efficiently Imaging the Surface Feel of Objects for Robotic Manipulation

arXiv:2403.15107v22 citationsh-index: 15ICRA
Originality Incremental advance
AI Analysis

This work addresses the problem of limited tactile sensing in robotics for tasks like object recognition and grasp stability, offering an incremental improvement through a novel visual-tactile embedding method.

The paper tackles the challenge of enabling robots to sense object surface feel for manipulation by linking high-dimensional structural information to low-dimensional tactile signals, achieving an 84% object recognition accuracy after ten touches and a 32% absolute improvement in grasp stability prediction accuracy.

Tactile sensing is vital for human dexterous manipulation, however, it has not been widely used in robotics. Compact, low-cost sensing platforms can facilitate a change, but unlike their popular optical counterparts, they are difficult to deploy in high-fidelity tasks due to their low signal dimensionality and lack of a simulation model. To overcome these challenges, we introduce PseudoTouch which links high-dimensional structural information to low-dimensional sensor signals. It does so by learning a low-dimensional visual-tactile embedding, wherein we encode a depth patch from which we decode the tactile signal. We collect and train PseudoTouch on a dataset comprising aligned tactile and visual data pairs obtained through random touching of eight basic geometric shapes. We demonstrate the utility of our trained PseudoTouch model in two downstream tasks: object recognition and grasp stability prediction. In the object recognition task, we evaluate the learned embedding's performance on a set of five basic geometric shapes and five household objects. Using PseudoTouch, we achieve an object recognition accuracy 84% after just ten touches, surpassing a proprioception baseline. For the grasp stability task, we use ACRONYM labels to train and evaluate a grasp success predictor using PseudoTouch's predictions derived from virtual depth information. Our approach yields a 32% absolute improvement in accuracy compared to the baseline relying on partial point cloud data. We make the data, code, and trained models publicly available at https://pseudotouch.cs.uni-freiburg.de.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes