Smoother Action Chunking Flow Policy via Prior-Corrected Orthogonal Trust-Region Guidance
For robot learning practitioners using flow-matching policies, POTR offers a simple fix to chunk-boundary artifacts without changing the base policy.
POTR improves action-chunking flow policies by reducing discontinuous transitions at chunk boundaries, achieving higher success rates and lower acceleration/jerk on LIBERO with π0.5 compared to prior RTC guidance.
Flow-matching robot policies commonly use action-chunking inference for efficient closed-loop control, but chunk boundaries can introduce discontinuous action transitions. Existing RTC guidance improves continuity by injecting correction signals during denoising, yet its weight schedule is weak at intermediate timesteps and its unconstrained correction direction may introduce transverse perturbations. We propose POTR, a **p**rior-corrected **o**rthogonal **t**rust-**r**egion guidance method. First, we incorporate a data-prior scale $σ_d$ into the RTC guidance weight, yielding stronger intermediate-time correction. Second, we decompose the guidance vector into components parallel and perpendicular to the denoising velocity, and constrain the perpendicular component within a trust region. On LIBERO with $π_{0.5}$, POTR improves success rate and consistently reduces chunk-boundary discontinuity, acceleration, and jerk compared with RTC. Ablations show that the prior-corrected weight provides the main correction gain, while the orthogonal trust region further improves stability.