ROCVMar 19, 2025

DRoPE: Directional Rotary Position Embedding for Efficient Agent Interaction Modeling

arXiv:2503.15029v110 citationsh-index: 14
Originality Incremental advance
AI Analysis

This addresses the challenge of balancing accuracy, computational time, and memory efficiency in autonomous driving systems, representing an incremental improvement over existing methods.

The paper tackled the problem of accurately and efficiently modeling agent interactions for trajectory generation in autonomous driving by proposing Directional Rotary Position Embedding (DRoPE), which improved accuracy while significantly reducing space complexity compared to state-of-the-art models.

Accurate and efficient modeling of agent interactions is essential for trajectory generation, the core of autonomous driving systems. Existing methods, scene-centric, agent-centric, and query-centric frameworks, each present distinct advantages and drawbacks, creating an impossible triangle among accuracy, computational time, and memory efficiency. To break this limitation, we propose Directional Rotary Position Embedding (DRoPE), a novel adaptation of Rotary Position Embedding (RoPE), originally developed in natural language processing. Unlike traditional relative position embedding (RPE), which introduces significant space complexity, RoPE efficiently encodes relative positions without explicitly increasing complexity but faces inherent limitations in handling angular information due to periodicity. DRoPE overcomes this limitation by introducing a uniform identity scalar into RoPE's 2D rotary transformation, aligning rotation angles with realistic agent headings to naturally encode relative angular information. We theoretically analyze DRoPE's correctness and efficiency, demonstrating its capability to simultaneously optimize trajectory generation accuracy, time complexity, and space complexity. Empirical evaluations compared with various state-of-the-art trajectory generation models, confirm DRoPE's good performance and significantly reduced space complexity, indicating both theoretical soundness and practical effectiveness. The video documentation is available at https://drope-traj.github.io/.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes