ROMar 9, 2018

Deep Visuo-Tactile Learning: Estimation of Tactile Properties from Images

arXiv:1803.03435v470 citations
AI Analysis

This enables robots to better understand their environment and adapt actions, though it is incremental as it builds on existing encoder-decoder methods.

The paper tackles the problem of estimating tactile properties like slipperiness or roughness from visual input alone, proposing a model that uses an encoder-decoder network with visual and tactile latent features, and shows generalization to unseen materials without manual labeling.

Estimation of tactile properties from vision, such as slipperiness or roughness, is important to effectively interact with the environment. These tactile properties help us decide which actions we should choose and how to perform them. E.g., we can drive slower if we see that we have bad traction or grasp tighter if an item looks slippery. We believe that this ability also helps robots to enhance their understanding of the environment, and thus enables them to tailor their actions to the situation at hand. We therefore propose a model to estimate the degree of tactile properties from visual perception alone (e.g., the level of slipperiness or roughness). Our method extends a encoder-decoder network, in which the latent variables are visual and tactile features. In contrast to previous works, our method does not require manual labeling, but only RGB images and the corresponding tactile sensor data. All our data is collected with a webcam and uSkin tactile sensor mounted on the end-effector of a Sawyer robot, which strokes the surfaces of 25 different materials. We show that our model generalizes to materials not included in the training data by evaluating the feature space, indicating that it has learned to associate important tactile properties with images.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes