RO AIJan 23

Sim-to-Real Transfer via a Style-Identified Cycle Consistent Generative Adversarial Network: Zero-Shot Deployment on Robotic Manipulators through Visual Domain Adaptation

Lucía Güitta-López, Lionel Güitta-López, Jaime Boal, Álvaro Jesús López-López

arXiv:2601.16677v12.21 citationsh-index: 1

Originality Incremental advance

AI Analysis

This addresses the sample efficiency problem for industrial robotics by enabling cost-effective virtual training with direct real-world deployment, though it is incremental as it builds on existing CycleGAN-based domain adaptation.

This paper tackles the sim-to-real gap in Deep Reinforcement Learning for robotic manipulation by proposing a Style-Identified Cycle Consistent Generative Adversarial Network (SICGAN) to translate virtual observations into real-like images, enabling zero-shot transfer. The method achieved success rates of 90-100% in simulation and above 95% accuracy in real-world deployment for pick-and-place tasks without additional training.

The sample efficiency challenge in Deep Reinforcement Learning (DRL) compromises its industrial adoption due to the high cost and time demands of real-world training. Virtual environments offer a cost-effective alternative for training DRL agents, but the transfer of learned policies to real setups is hindered by the sim-to-real gap. Achieving zero-shot transfer, where agents perform directly in real environments without additional tuning, is particularly desirable for its efficiency and practical value. This work proposes a novel domain adaptation approach relying on a Style-Identified Cycle Consistent Generative Adversarial Network (StyleID-CycleGAN or SICGAN), an original Cycle Consistent Generative Adversarial Network (CycleGAN) based model. SICGAN translates raw virtual observations into real-synthetic images, creating a hybrid domain for training DRL agents that combines virtual dynamics with real-like visual inputs. Following virtual training, the agent can be directly deployed, bypassing the need for real-world training. The pipeline is validated with two distinct industrial robots in the approaching phase of a pick-and-place operation. In virtual environments agents achieve success rates of 90 to 100\%, and real-world deployment confirms robust zero-shot transfer (i.e., without additional training in the physical environment) with accuracies above 95\% for most workspace regions. We use augmented reality targets to improve the evaluation process efficiency, and experimentally demonstrate that the agent successfully generalizes to real objects of varying colors and shapes, including LEGO\textsuperscript{\textregistered}~cubes and a mug. These results establish the proposed pipeline as an efficient, scalable solution to the sim-to-real problem.

View on arXiv PDF

Similar