BiFF: Bi-level Future Fusion with Polyline-based Coordinate for Interactive Trajectory Prediction
This addresses the safety-critical need for accurate multi-agent trajectory prediction in autonomous driving, representing an incremental improvement over existing methods.
The paper tackles the problem of predicting joint trajectories for interactive agents in autonomous driving by proposing Bi-level Future Fusion (BiFF) to capture future interactions, achieving state-of-the-art performance on the Waymo Open Motion Dataset benchmark.
Predicting future trajectories of surrounding agents is essential for safety-critical autonomous driving. Most existing work focuses on predicting marginal trajectories for each agent independently. However, it has rarely been explored in predicting joint trajectories for interactive agents. In this work, we propose Bi-level Future Fusion (BiFF) to explicitly capture future interactions between interactive agents. Concretely, BiFF fuses the high-level future intentions followed by low-level future behaviors. Then the polyline-based coordinate is specifically designed for multi-agent prediction to ensure data efficiency, frame robustness, and prediction accuracy. Experiments show that BiFF achieves state-of-the-art performance on the interactive prediction benchmark of Waymo Open Motion Dataset.