CVCLLGOct 24, 2023

TiC-CLIP: Continual Training of CLIP Models

U of Toronto
arXiv:2310.16226v345 citationsh-index: 47Has Code
Originality Incremental advance
AI Analysis

This addresses the challenge of keeping foundation models current for AI practitioners, though it is incremental as it builds on existing CLIP and continual learning methods.

The paper tackled the problem of efficiently updating large vision-language models like CLIP with new data without costly retraining, by introducing web-scale continual learning benchmarks and showing that a rehearsal-based approach reduces compute by 2.5x compared to retraining from scratch.

Keeping large foundation models up to date on latest data is inherently expensive. To avoid the prohibitive costs of constantly retraining, it is imperative to continually train these models. This problem is exacerbated by the lack of any large scale continual learning benchmarks or baselines. We introduce the first set of web-scale Time-Continual (TiC) benchmarks for training vision-language models: TiC-DataComp, TiC-YFCC, and TiC-Redcaps. TiC-DataComp, our largest dataset, contains over 12.7B timestamped image-text pairs spanning 9 years (2014-2022). We first use our benchmarks to curate various dynamic evaluations to measure temporal robustness of existing models. We show OpenAI's CLIP (trained on data up to 2020) loses $\approx 8\%$ zero-shot accuracy on our curated retrieval task from 2021-2022 compared with more recently trained models in OpenCLIP repository. We then study how to efficiently train models on time-continuous data. We demonstrate that a simple rehearsal-based approach that continues training from the last checkpoint and replays old data reduces compute by $2.5\times$ when compared to the standard practice of retraining from scratch. Code is available at https://github.com/apple/ml-tic-clip.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes