CVAug 22, 2024
Towards Optimal Aggregation of Varying Range Dependencies in Haze RemovalXiaozhe Zhang, Fengying Xie, Haidong Ding et al.
Haze removal aims to restore a clear image from a hazy input. Existing methods achieve notable success by specializing in either short-range dependencies to preserve local details or long-range dependencies to capture global context. Given the complementary strengths of both, a natural progression is to explicitly integrate them within a unified framework and enable their reasonable aggregation. However, this integration remains underexplored. In this paper, we propose DehazeMatic, which simultaneously and explicitly captures both short- and long-range dependencies through a dual-stream design. To optimize the contribution of dependencies at varying ranges, we conduct extensive experiments to identify key influencing factors and find that an effective aggregation mechanism should be guided by the joint consideration of haze density and semantic information. Building on these insights, we introduce the CLIP-enhanced Dual-path Aggregator, which not only enables the generation of fine-grained haze density maps for the first time, but also produces semantic maps within a shared backbone, ultimately leveraging both to instruct the aggregation process. Extensive experiments demonstrate that DehazeMatic outperforms state-of-the-art methods across multiple benchmarks.
CVApr 20, 2021
A novel three-stage training strategy for long-tailed classificationGongzhe Li, Zhiwen Tan, Linpeng Pan
The long-tailed distribution datasets poses great challenges for deep learning based classification models on how to handle the class imbalance problem. Existing solutions usually involve class-balacing strategies or transfer learing from head- to tail-classes or use two-stages learning strategy to re-train the classifier. However, the existing methods are difficult to solve the low quality problem when images are obtained by SAR. To address this problem, we establish a novel three-stages training strategy, which has excellent results for processing SAR image datasets with long-tailed distribution. Specifically, we divide training procedure into three stages. The first stage is to use all kinds of images for rough-training, so as to get the rough-training model with rich content. The second stage is to make the rough model learn the feature expression by using the residual dataset with the class 0 removed. The third stage is to fine tune the model using class-balanced datasets with all 10 classes (including the overall model fine tuning and classifier re-optimization). Through this new training strategy, we only use the information of SAR image dataset and the network model with very small parameters to achieve the top 1 accuracy of 22.34 in development phase.