CVMay 19Code
SDM: A Powerful Tool for Evaluating Model RobustnessXinlei Liu, Tao Hu, Jichao Xie et al.
Gradient-based attacks are important methods for evaluating model robustness. However, since the proposal of APGD, it has been difficult for such methods to achieve significant breakthroughs. To achieve such an effect, we first analyze the issue of "high-loss non-adversarial examples" that degrades attack performance in previous methods, and prove that this issue arises from inappropriate objectives for adversarial example generation. Subsequently, we reconstruct the objective as "maximizing the difference between the non-ground-truth label probability upper bound and the ground-truth label probability", and proposes a novel and powerful gradient-based attack method named Sequential Difference Maximization (SDM). SDM establishes a three-layer optimization framework of "cycle-stage-step". It adopts the negative probability loss function and the Directional Probability Difference Ratio (DPDR) loss function in the initial and subsequent optimization stages, respectively, and approaches the ideal objective of adversarial example generation via stage-wise sequential optimization. Experiments demonstrate that compared with previous state-of-the-art methods, SDM not only achieves stronger attack performance but also exhibits superior cost-effectiveness. The code is available at https://github.com/X-L-Liu/ICML-SDM.
ROMar 16
Emergent Dexterity via Diverse Resets and Large-Scale Reinforcement LearningPatrick Yin, Tyler Westenbroek, Zhengyu Zhang et al.
Reinforcement learning in massively parallel physics simulations has driven major progress in sim-to-real robot learning. However, current approaches remain brittle and task-specific, relying on extensive per-task engineering to design rewards, curricula, and demonstrations. Even with this engineering, they often fail on long-horizon, contact-rich manipulation tasks and do not meaningfully scale with compute, as performance quickly saturates when training revisits the same narrow regions of state space. We introduce \Method, a simple and scalable framework that enables on-policy reinforcement learning to robustly solve a broad class of dexterous manipulation tasks using a single reward function, fixed algorithm hyperparameters, no curricula, and no human demonstrations. Our key insight is that long-horizon exploration can be dramatically simplified by using simulator resets to systematically expose the RL algorithm to the diverse set of robot-object interactions which underlie dexterous manipulation. \Method\ programmatically generates such resets with minimal human input, converting additional compute directly into broader behavioral coverage and continued performance gains. We show that \Method\ gracefully scales to long-horizon dexterous manipulation tasks beyond the capabilities of existing approaches and is able to learn robust policies over significantly wider ranges of initial conditions than baselines. Finally, we distill \Method \ into visuomotor policies which display robust retrying behavior and substantially higher success rates than baselines when transferred to the real world zero-shot. Project webpage: https://omnireset.github.io
CVAug 31, 2025Code
Sequential Difference Maximization: Generating Adversarial Examples via Multi-Stage OptimizationXinlei Liu, Tao Hu, Peng Yi et al.
Efficient adversarial attack methods are critical for assessing the robustness of computer vision models. In this paper, we reconstruct the optimization objective for generating adversarial examples as "maximizing the difference between the non-true labels' probability upper bound and the true label's probability," and propose a gradient-based attack method termed Sequential Difference Maximization (SDM). SDM establishes a three-layer optimization framework of "cycle-stage-step." The processes between cycles and between iterative steps are respectively identical, while optimization stages differ in terms of loss functions: in the initial stage, the negative probability of the true label is used as the loss function to compress the solution space; in subsequent stages, we introduce the Directional Probability Difference Ratio (DPDR) loss function to gradually increase the non-true labels' probability upper bound by compressing the irrelevant labels' probabilities. Experiments demonstrate that compared with previous SOTA methods, SDM not only exhibits stronger attack performance but also achieves higher attack cost-effectiveness. Additionally, SDM can be combined with adversarial training methods to enhance their defensive effects. The code is available at https://github.com/X-L-Liu/SDM.
NIMar 24
RF-Zero-Wire: Design and Analysis of Multi-Hop Low-latency Symbol-synchronous RF CommunicationXinlei Liu, Andrey Belogaev, Jonathan Oostvogels et al.
The latency gap between wired and wireless networks poses a challenge in the adoption of wireless technologies in latency-sensitive scenarios. The gap is especially notable in multi-hop communication typical for industrial sensor networks and robotic swarms. The main reason behind it is that commonly used wireless protocols rely on store-and-forward routing and costly overhead procedures to avoid interference. This article introduces RF-Zero-Wire, an RF-based symbol-synchronous communication protocol. Instead of relaying the whole frame per hop in a store-and-forward manner, nodes concurrently relay the frame symbol by symbol, without the need for tight time synchronization. Based on data collected in real-world experiments, we reveal that the inevitable carrier frequency offsets (CFOs) introduced by imperfect crystal oscillators cause a beating effect under concurrent symbol transmissions. This is characterized by periodic constructive and destructive interference, which significantly affects reliability. Subsequently, a thorough simulation study shows how the beating problem can be overcome with error correction codes. RF-Zero-Wire allows achieving an end-to-end latency of less than 1ms for a small 4-byte frame transmitted across 5 hops. Moreover, latency is shown to increase only by 0.16% per extra hop for 16-byte frames, which is negligible compared to the over 100% per-hop latency increase observed in store-and-forward protocols. The trade-offs between network reliability and CFO range, communication distance, node density, and achievable data rate are studied in large-scale experiments based on simulation.
NIMar 24
Symbol-Synchronous Communication for Ultra-Low-Power Multi-Hop Ambient IoT NetworksXinlei Liu, Andrey Belogaev, Jeroen Famaey
Ambient Internet of Things (A-IoT) devices, as a critical enabler of future green IoT networks, have attracted broad interest from both industry and academia due to their ability to operate without batteries and with low maintenance costs. To accommodate their dynamic and constrained energy budget, an ultra-low-power connectivity protocol is required. Due to the severely limited transmit power of A-IoT devices, multi-hop connectivity is an interesting paradigm to extend their range. However, commonly used protocols for multi-hop communication may not be suitable for A-IoT due to excessive overhead related to channel access procedures, coordinated routing, and tight time synchronization requirements. This paper presents a novel network connectivity protocol based on symbol-synchronous transmissions, which allows battery-less relay nodes to participate in the forwarding process in an ad-hoc manner, without the need for synchronization or coordination. This allows them to adapt their duty cycle to the available harvested energy. Simulation results show that the proposed protocol can ensure high reliability in data packet delivery while significantly reducing the energy consumption of each relay node. We also investigate the relationship between wake-up probability and network density. For example, a 400-node network in a 625 m2 area can achieve a packet error rate below 1 % with an average awake time of 6 % per node, achieving an energy consumption reduction of 88 % compared to the baseline approach.