LGCVMLMay 16, 2019

Joint Learning of Neural Networks via Iterative Reweighted Least Squares

arXiv:1905.06526v24 citations
Originality Incremental advance
AI Analysis

This work addresses the challenge of leveraging correlated information across diverse datasets for neural network learning, offering a method that is incremental but provides specific gains in multi-dataset scenarios.

The paper tackles the problem of jointly learning neural networks across multiple datasets to improve performance by sharing weights and automatically determining layer sharing, achieving better results than isolated learning and pretraining-finetuning baselines on image classification, auto-encoders, and image generation tasks.

In this paper, we introduce the problem of jointly learning feed-forward neural networks across a set of relevant but diverse datasets. Compared to learning a separate network from each dataset in isolation, joint learning enables us to extract correlated information across multiple datasets to significantly improve the quality of learned networks. We formulate this problem as joint learning of multiple copies of the same network architecture and enforce the network weights to be shared across these networks. Instead of hand-encoding the shared network layers, we solve an optimization problem to automatically determine how layers should be shared between each pair of datasets. Experimental results show that our approach outperforms baselines without joint learning and those using pretraining-and-fine-tuning. We show the effectiveness of our approach on three tasks: image classification, learning auto-encoders, and image generation.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes