LGMay 24, 2023

IBCL: Zero-shot Model Generation under Stability-Plasticity Trade-offs

Pengyuan Lu, Michele Caprio, Eric Eaton, Insup Lee

arXiv:2305.14782v410.76 citationsHas Code

Originality Incremental advance

AI Analysis

This addresses the inefficiency of retraining models for different trade-off preferences in continual learning, offering a zero-shot solution that is incremental in method but impactful for practical applications.

The paper tackles the problem of generating models for specified stability-plasticity trade-offs in continual learning without retraining, proposing IBCL, which improves classification accuracy by up to 44% on average per task and reduces training overhead to constant time.

Algorithms that balance the stability-plasticity trade-off are well studied in the Continual Learning literature. However, only a few focus on obtaining models for specified trade-off preferences. When solving the problem of continual learning under specific trade-offs (CLuST), state-of-the-art techniques leverage rehearsal-based learning, which requires retraining when a model corresponding to a new trade-off preference is requested. This is inefficient, since there potentially exists a significant number of different trade-offs, and a large number of models may be requested. As a response, we propose Imprecise Bayesian Continual Learning (IBCL), an algorithm that tackles CLuST efficiently. IBCL replaces retraining with a constant-time convex combination. Given a new task, IBCL (1) updates the knowledge base as a convex hull of model parameter distributions, and (2) generates one Pareto-optimal model per given trade-off via convex combination without additional training. That is, obtaining models corresponding to specified trade-offs via IBCL is zero-shot. Experiments whose baselines are current CLuST algorithms show that IBCL improves classification by at most 44% on average per task accuracy, and by 45% on peak per task accuracy while maintaining a near-zero to positive backward transfer, with memory overheads converging to constants. In addition, its training overhead, measured by the number of batch updates, remains constant at every task, regardless of the number of preferences requested. IBCL also improves multi-objective reinforcement learning tasks by maintaining the same Pareto front hypervolume, while significantly reducing the training cost. Details can be found at: https://github.com/ibcl-anon/ibcl.

View on arXiv PDF Code

Similar