VROOM - Visual Reconstruction over Onboard Multiview
This addresses the problem of scalable 4D reconstruction from onboard video for applications in motorsports or autonomous systems, but appears incremental as it combines existing methods.
The paper tackles 3D reconstruction of Formula 1 circuits using only onboard camera footage from racecars, showing that their VROOM system can partially recover track and vehicle trajectories in complex environments like the 2023 Monaco Grand Prix.
We introduce VROOM, a system for reconstructing 3D models of Formula 1 circuits using only onboard camera footage from racecars. Leveraging video data from the 2023 Monaco Grand Prix, we address video challenges such as high-speed motion and sharp cuts in camera frames. Our pipeline analyzes different methods such as DROID-SLAM, AnyCam, and Monst3r and combines preprocessing techniques such as different methods of masking, temporal chunking, and resolution scaling to account for dynamic motion and computational constraints. We show that Vroom is able to partially recover track and vehicle trajectories in complex environments. These findings indicate the feasibility of using onboard video for scalable 4D reconstruction in real-world settings. The project page can be found at https://varun-bharadwaj.github.io/vroom, and our code is available at https://github.com/yajatyadav/vroom.