ROAICVLGApr 26, 2022

Coarse-to-fine Q-attention with Tree Expansion

arXiv:2204.12471v211 citationsh-index: 164
Originality Incremental advance
AI Analysis

This addresses sample efficiency and ambiguity issues in robot manipulation, particularly for tasks with small or similar objects, but is incremental as it builds on existing Q-attention methods.

The paper tackled the problem of 'coarse ambiguity' in coarse-to-fine Q-attention for robot manipulation, which hinders distinguishing similar objects at coarse resolutions, and proposed Q-attention with Tree Expansion (QTE) to accumulate value estimates across voxels, resulting in improved performance on 12 RLBench tasks and a real-world task with small objects.

Coarse-to-fine Q-attention enables sample-efficient robot manipulation by discretizing the translation space in a coarse-to-fine manner, where the resolution gradually increases at each layer in the hierarchy. Although effective, Q-attention suffers from "coarse ambiguity" - when voxelization is significantly coarse, it is not feasible to distinguish similar-looking objects without first inspecting at a finer resolution. To combat this, we propose to envision Q-attention as a tree that can be expanded and used to accumulate value estimates across the top-k voxels at each Q-attention depth. When our extension, Q-attention with Tree Expansion (QTE), replaces standard Q-attention in the Attention-driven Robot Manipulation (ARM) system, we are able to accomplish a larger set of tasks; especially on those that suffer from "coarse ambiguity". In addition to evaluating our approach across 12 RLBench tasks, we also show that the improved performance is visible in a real-world task involving small objects.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes