CVSep 9, 2024

Open-World Dynamic Prompt and Continual Visual Representation Learning

Amazon
arXiv:2409.05312v25 citationsh-index: 19
Originality Incremental advance
AI Analysis

This addresses the problem of generalizing to unseen classes in continual learning for visual tasks, though it appears incremental as it builds on prior prompt-based methods.

The paper tackles the challenge of continual learning in dynamic open-world environments by introducing a new setting and a method called DPaRL, which learns dynamic prompts and representations, achieving an average 4.7% improvement in Recall@1 on image retrieval benchmarks.

The open world is inherently dynamic, characterized by ever-evolving concepts and distributions. Continual learning (CL) in this dynamic open-world environment presents a significant challenge in effectively generalizing to unseen test-time classes. To address this challenge, we introduce a new practical CL setting tailored for open-world visual representation learning. In this setting, subsequent data streams systematically introduce novel classes that are disjoint from those seen in previous training phases, while also remaining distinct from the unseen test classes. In response, we present Dynamic Prompt and Representation Learner (DPaRL), a simple yet effective Prompt-based CL (PCL) method. Our DPaRL learns to generate dynamic prompts for inference, as opposed to relying on a static prompt pool in previous PCL methods. In addition, DPaRL jointly learns dynamic prompt generation and discriminative representation at each training stage whereas prior PCL methods only refine the prompt learning throughout the process. Our experimental results demonstrate the superiority of our approach, surpassing state-of-the-art methods on well-established open-world image retrieval benchmarks by an average of 4.7% improvement in Recall@1 performance.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes