LGAIAOFeb 17, 2021

Mode-Assisted Joint Training of Deep Boltzmann Machines

arXiv:2102.08562v1
Originality Highly original
AI Analysis

This addresses the problem of inefficient training and high parameter counts in DBMs for researchers and practitioners in machine learning, offering a more compact and hardware-friendly solution.

The paper tackles the challenging unsupervised joint training of deep Boltzmann machines (DBMs) by applying a mode-assisted algorithm, resulting in DBMs that can represent datasets with orders of magnitude fewer parameters compared to state-of-the-art methods and RBMs.

The deep extension of the restricted Boltzmann machine (RBM), known as the deep Boltzmann machine (DBM), is an expressive family of machine learning models which can serve as compact representations of complex probability distributions. However, jointly training DBMs in the unsupervised setting has proven to be a formidable task. A recent technique we have proposed, called mode-assisted training, has shown great success in improving the unsupervised training of RBMs. Here, we show that the performance gains of the mode-assisted training are even more dramatic for DBMs. In fact, DBMs jointly trained with the mode-assisted algorithm can represent the same data set with orders of magnitude lower number of total parameters compared to state-of-the-art training procedures and even with respect to RBMs, provided a fan-in network topology is also introduced. This substantial saving in number of parameters makes this training method very appealing also for hardware implementations.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes