RO AI CV LGApr 26, 2022

Coarse-to-fine Q-attention with Tree Expansion

arXiv:2204.12471v213.511 citationsh-index: 164

Originality Incremental advance

AI Analysis

This addresses sample efficiency and ambiguity issues in robot manipulation, particularly for tasks with small or similar objects, but is incremental as it builds on existing Q-attention methods.

The paper tackled the problem of 'coarse ambiguity' in coarse-to-fine Q-attention for robot manipulation, which hinders distinguishing similar objects at coarse resolutions, and proposed Q-attention with Tree Expansion (QTE) to accumulate value estimates across voxels, resulting in improved performance on 12 RLBench tasks and a real-world task with small objects.

Coarse-to-fine Q-attention enables sample-efficient robot manipulation by discretizing the translation space in a coarse-to-fine manner, where the resolution gradually increases at each layer in the hierarchy. Although effective, Q-attention suffers from "coarse ambiguity" - when voxelization is significantly coarse, it is not feasible to distinguish similar-looking objects without first inspecting at a finer resolution. To combat this, we propose to envision Q-attention as a tree that can be expanded and used to accumulate value estimates across the top-k voxels at each Q-attention depth. When our extension, Q-attention with Tree Expansion (QTE), replaces standard Q-attention in the Attention-driven Robot Manipulation (ARM) system, we are able to accomplish a larger set of tasks; especially on those that suffer from "coarse ambiguity". In addition to evaluating our approach across 12 RLBench tasks, we also show that the improved performance is visible in a real-world task involving small objects.

View on arXiv PDF

Similar