ROJul 27, 2021

End-To-End Real-Time Visual Perception Framework for Construction Automation

arXiv:2107.12701v1
Originality Incremental advance
AI Analysis

This addresses the problem of labor-intensive construction tasks for robotics and automation, but it is incremental as it builds on existing detection methods.

The paper tackles automating wall construction by developing an end-to-end visual perception framework that detects and localizes bricks in clutter, using a rotated bounding box method that outperforms upright detectors with reported precision, recall, and mAP scores.

In this work, we present a robotic solution to automate the task of wall construction. To that end, we present an end-to-end visual perception framework that can quickly detect and localize bricks in a clutter. Further, we present a light computational method of brick pose estimation that incorporates the above information. The proposed detection network predicts a rotated box compared to YOLO and SSD, thereby maximizing the object's region in the predicted box regions. In addition, precision P, recall R, and mean-average-precision (mAP) scores are reported to evaluate the proposed framework. We observed that for our task, the proposed scheme outperforms the upright bounding box detectors. Further, we deploy the proposed visual perception framework on a robotic system endowed with a UR5 robot manipulator and demonstrate that the system can successfully replicate a simplified version of the wall-building task in an autonomous mode.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes