CVMar 5

FOZO: Forward-Only Zeroth-Order Prompt Optimization for Test-Time Adaptation

arXiv:2603.04733v11 citations

Originality Highly original

AI Analysis

This work addresses the problem of efficient and stable model adaptation to distribution shifts for deep learning models, particularly for deployment on resource-limited devices.

This paper introduces Forward-Only Zeroth-Order Optimization (FOZO), a novel backpropagation-free method for Test-Time Adaptation (TTA) that optimizes prompts using zeroth-order methods. FOZO achieves 59.52% Top-1 accuracy on ImageNet-C (5K, level 5), outperforming state-of-the-art gradient-based methods and the forward-only FOA (58.13%).

Test-Time Adaptation (TTA) is essential for enabling deep learning models to handle real-world data distribution shifts. However, current approaches face significant limitations: backpropagation-based methods are not suitable for low-end deployment devices, due to their high computation and memory requirements, as well as their tendency to modify model weights during adaptation; while traditional backpropagation-free techniques exhibit constrained adaptation capabilities. In this work, we propose Forward-Only Zeroth-Order Optimization (FOZO), a novel and practical backpropagation-free paradigm for TTA. FOZO leverages a memory-efficient zeroth-order prompt optimization, which is led by objectives optimizing both intermediate feature statistics and prediction entropy. To ensure efficient and stable adaptation over the out-of-distribution data stream, we introduce a dynamically decaying perturbation scale during zeroth-order gradient estimation and theoretically prove its convergence under the TTA data stream assumption. Extensive continual adaptation experiments on ImageNet-C, ImageNet-R, and ImageNet-Sketch demonstrate FOZO's superior performance, achieving 59.52% Top-1 accuracy on ImageNet-C (5K, level 5) and outperforming main gradient-based methods and SOTA forward-only FOA (58.13%). Furthermore, FOZO exhibits strong generalization on quantized (INT8) models. These findings demonstrate that FOZO is a highly competitive solution for TTA deployment in resource-limited scenarios.

View on arXiv PDF

Similar