A Toolkit for Virtual Reality Data Collection
This toolkit addresses the problem of limited VR datasets for researchers in deep-learning, psychological modeling, and data analysis, though it is incremental as it builds on existing data collection methods.
The authors tackled the challenge of acquiring large-scale virtual reality datasets by developing a versatile data collection toolkit that integrates with any device and includes a robust pipeline emphasizing ethics and reproducibility, resulting in a tool designed to facilitate extensive VR data capture.
Due to the still relatively low number of users, acquiring large-scale and multidimensional virtual reality datasets remains a significant challenge. Consequently, VR datasets comparable in size to state-of-the-art collections in natural language processing or computer vision are rare or absent. However, the availability of such datasets could unlock groundbreaking advancements in deep-learning, psychological modeling, and data analysis in the context of VR. In this paper, we present a versatile data collection toolkit designed to facilitate the capturing of extensive VR datasets. Our toolkit seamlessly integrates with any device, either directly via OpenXR or through the use of a virtual device. Additionally, we introduce a robust data collection pipeline that emphasizes ethical practices (e.g., ensuring data protection and regulation) and ensures a standardized, reproducible methodology.