Xuanhui Xu

ROJul 20, 2023

Goal-Conditioned Reinforcement Learning with Disentanglement-based Reachability Planning

Zhifeng Qian, Mingyu You, Hongjun Zhou et al.

Goal-Conditioned Reinforcement Learning (GCRL) can enable agents to spontaneously set diverse goals to learn a set of skills. Despite the excellent works proposed in various fields, reaching distant goals in temporally extended tasks remains a challenge for GCRL. Current works tackled this problem by leveraging planning algorithms to plan intermediate subgoals to augment GCRL. Their methods need two crucial requirements: (i) a state representation space to search valid subgoals, and (ii) a distance function to measure the reachability of subgoals. However, they struggle to scale to high-dimensional state space due to their non-compact representations. Moreover, they cannot collect high-quality training data through standard GC policies, which results in an inaccurate distance function. Both affect the efficiency and performance of planning and policy learning. In the paper, we propose a goal-conditioned RL algorithm combined with Disentanglement-based Reachability Planning (REPlan) to solve temporally extended tasks. In REPlan, a Disentangled Representation Module (DRM) is proposed to learn compact representations which disentangle robot poses and object positions from high-dimensional observations in a self-supervised manner. A simple REachability discrimination Module (REM) is also designed to determine the temporal distance of subgoals. Moreover, REM computes intrinsic bonuses to encourage the collection of novel states for training. We evaluate our REPlan in three vision-based simulation tasks and one real-world task. The experiments demonstrate that our REPlan significantly outperforms the prior state-of-the-art methods in solving temporally extended tasks.

HCMay 18, 2021

3D Displays: Their Evolution, Inherent Challenges & Future Perspectives

Xingyu Pan, Xuanhui Xu, Soumyabrata Dev et al.

The popularity of 3D displays has risen drastically over the past few decades but these displays are still merely a novelty compared to their true potential. The development has mostly focused on Head Mounted Displays (HMD) development for Virtual Reality and in general ignored non-HMD 3D displays. This is due to the inherent difficulty in the creation of these displays and their impracticability in general use due to cost, performance, and lack of meaningful use cases. In fairness to the hardware manufacturers who have made striking innovations in this field, there has been a dereliction of duty of software developers and researchers in terms of developing software to best utilize these displays. This paper will seek to identify what areas of future software development could mitigate this dereliction. To achieve this goal, the paper will first examine the current state of the art and perform a comparative analysis on different types of 3D displays, from this analysis a clear researcher gap exists in terms of software development for Light field displays which are the current state of the art of non-HMD-based 3D displays. The paper will then outline six distinct areas where the context-awareness concept will allow for non-HMD-based 3D displays in particular light field displays that can not only compete but surpass their HMD-based brethren for many specific use cases.

Xuanhui Xu

2 Papers