CVLGJan 17, 2023

Vision Based Machine Learning Algorithms for Out-of-Distribution Generalisation

arXiv:2301.06975v16 citationsh-index: 66
Originality Synthesis-oriented
AI Analysis

This addresses the problem of poor generalization in computer vision models for researchers and practitioners, but it is incremental as it focuses on benchmarking existing methods.

The paper compared vision-based machine learning methods for domain-specific and domain-generalized tasks, finding that simple CNN-based models perform poorly on out-of-distribution datasets like PACS and Office-Home, with results showing decreased accuracy in domain-shifting scenarios.

There are many computer vision applications including object segmentation, classification, object detection, and reconstruction for which machine learning (ML) shows state-of-the-art performance. Nowadays, we can build ML tools for such applications with real-world accuracy. However, each tool works well within the domain in which it has been trained and developed. Often, when we train a model on a dataset in one specific domain and test on another unseen domain known as an out of distribution (OOD) dataset, models or ML tools show a decrease in performance. For instance, when we train a simple classifier on real-world images and apply that model on the same classes but with a different domain like cartoons, paintings or sketches then the performance of ML tools disappoints. This presents serious challenges of domain generalisation (DG), domain adaptation (DA), and domain shifting. To enhance the power of ML tools, we can rebuild and retrain models from scratch or we can perform transfer learning. In this paper, we present a comparison study between vision-based technologies for domain-specific and domain-generalised methods. In this research we highlight that simple convolutional neural network (CNN) based deep learning methods perform poorly when they have to tackle domain shifting. Experiments are conducted on two popular vision-based benchmarks, PACS and Office-Home. We introduce an implementation pipeline for domain generalisation methods and conventional deep learning models. The outcome confirms that CNN-based deep learning models show poor generalisation compare to other extensive methods.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes