52.1ROApr 7Code
RoboBPP: Benchmarking Robotic Online Bin Packing with Physics-based SimulationZhoufeng Wang, Hang Zhao, Juzhan Xu et al.
Physical feasibility in 3D bin packing is a key requirement in modern industrial logistics and robotic automation. With the growing adoption of industrial automation, online bin packing has gained increasing attention. However, inconsistencies in problem settings, test datasets, and evaluation metrics have hindered progress in the field, and there is a lack of a comprehensive benchmarking system. Direct testing on real hardware is costly, and building a realistic simulation environment is also challenging. To address these limitations, we introduce RoboBPP, a benchmarking system designed for robotic online bin packing. RoboBPP integrates a physics-based simulator to assess physical feasibility. In our simulation environment, we introduce a robotic arm and boxes at real-world scales to replicate real industrial packing workflows. By simulating conditions that arise in real industrial applications, we ensure that evaluated algorithms are practically deployable. In addition, prior studies often rely on synthetic datasets whose distributions differ from real-world industrial data. To address this issue, we collect three datasets from real industrial workflows, including assembly-line production, logistics packing, and furniture manufacturing. The benchmark comprises three carefully designed test settings and extends existing evaluation metrics with new metrics for structural stability and operational safety. We design a scoring system and derive a range of insights from the evaluation results. RoboBPP is fully open-source and is equipped with visualization tools and an online leaderboard, providing a reproducible and extensible foundation for future research and industrial applications (https://robot-bin-packing-benchmark.github.io).
NANov 29, 2018
An Adaptive Finite Element DtN Method for Maxwell's Equations in Biperiodic StructuresXue Jiang, Peijun Li, Junliang Lv et al.
Consider the diffraction of an electromagnetic plane wave by a biperiodic structure where the wave propagation is governed by the three-dimensional Maxwell equations. Based on transparent boundary condition, the grating problem is formulated into a boundary value problem in a bounded domain. Using a duality argument technique, we derive an a posteriori error estimate for the finite element method with the truncation of the nonlocal Dirichlet-to-Neumann (DtN) boundary operator. The a posteriori error consists of both the finite element approximation error and the truncation error of boundary operator which decays exponentially with respect to the truncation parameter. An adaptive finite element algorithm is developed with error controlled by the a posterior error estimate, which determines the truncation parameter through the truncation error and adjusts the mesh through the finite element approximation error. Numerical experiments are presented to demonstrate the competitive behavior of the proposed adaptive method.
CVJul 23, 2025
RemixFusion: Residual-based Mixed Representation for Large-scale Online RGB-D ReconstructionYuqing Lan, Chenyang Zhu, Shuaifeng Zhi et al.
The introduction of the neural implicit representation has notably propelled the advancement of online dense reconstruction techniques. Compared to traditional explicit representations, such as TSDF, it improves the mapping completeness and memory efficiency. However, the lack of reconstruction details and the time-consuming learning of neural representations hinder the widespread application of neural-based methods to large-scale online reconstruction. We introduce RemixFusion, a novel residual-based mixed representation for scene reconstruction and camera pose estimation dedicated to high-quality and large-scale online RGB-D reconstruction. In particular, we propose a residual-based map representation comprised of an explicit coarse TSDF grid and an implicit neural module that produces residuals representing fine-grained details to be added to the coarse grid. Such mixed representation allows for detail-rich reconstruction with bounded time and memory budget, contrasting with the overly-smoothed results by the purely implicit representations, thus paving the way for high-quality camera tracking. Furthermore, we extend the residual-based representation to handle multi-frame joint pose optimization via bundle adjustment (BA). In contrast to the existing methods, which optimize poses directly, we opt to optimize pose changes. Combined with a novel technique for adaptive gradient amplification, our method attains better optimization convergence and global optimality. Furthermore, we adopt a local moving volume to factorize the mixed scene representation with a divide-and-conquer design to facilitate efficient online learning in our residual-based framework. Extensive experiments demonstrate that our method surpasses all state-of-the-art ones, including those based either on explicit or implicit representations, in terms of the accuracy of both mapping and tracking on large-scale scenes.