CV AI LGJul 4, 2022

Factorizing Knowledge in Neural Networks

arXiv:2207.03337v225.4136 citationsh-index: 67Has Code

Originality Incremental advance

AI Analysis

This work addresses the need for more interpretable and modular AI systems, offering a novel approach to knowledge factorization that could benefit researchers and practitioners in machine learning, though it appears incremental in building on existing transfer learning concepts.

The paper tackles the problem of modularizing knowledge in pretrained neural networks by decomposing them into factor networks that handle dedicated tasks, enabling plug-and-play assembly without fine-tuning. The method, evaluated on various benchmarks, shows strong performance on dedicated tasks, disentanglement, interpretability, and transfer learning.

In this paper, we explore a novel and ambitious knowledge-transfer task, termed Knowledge Factorization~(KF). The core idea of KF lies in the modularization and assemblability of knowledge: given a pretrained network model as input, KF aims to decompose it into several factor networks, each of which handles only a dedicated task and maintains task-specific knowledge factorized from the source network. Such factor networks are task-wise disentangled and can be directly assembled, without any fine-tuning, to produce the more competent combined-task networks. In other words, the factor networks serve as Lego-brick-like building blocks, allowing us to construct customized networks in a plug-and-play manner. Specifically, each factor network comprises two modules, a common-knowledge module that is task-agnostic and shared by all factor networks, alongside with a task-specific module dedicated to the factor network itself. We introduce an information-theoretic objective, InfoMax-Bottleneck~(IMB), to carry out KF by optimizing the mutual information between the learned representations and input. Experiments across various benchmarks demonstrate that, the derived factor networks yield gratifying performances on not only the dedicated tasks but also disentanglement, while enjoying much better interpretability and modularity. Moreover, the learned common-knowledge representations give rise to impressive results on transfer learning. Our code is available at https://github.com/Adamdad/KnowledgeFactor.

View on arXiv PDF Code

Similar