LGDIS-NNOct 21, 2024

Modeling Structured Data Learning with Restricted Boltzmann Machines in the Teacher-Student Setting

arXiv:2410.16150v25 citationsh-index: 4Neural Networks
Originality Synthesis-oriented
AI Analysis

This work provides incremental insights into structured data learning in RBMs, relevant for researchers in generative models and theoretical machine learning.

The study investigates how a student Restricted Boltzmann Machine learns structured data from a teacher RBM, finding that the critical data needed decreases with more patterns and correlations, but learning fails if inference temperature is too low, even with large datasets.

Restricted Boltzmann machines (RBM) are generative models capable to learn data with a rich underlying structure. We study the teacher-student setting where a student RBM learns structured data generated by a teacher RBM. The amount of structure in the data is controlled by adjusting the number of hidden units of the teacher and the correlations in the rows of the weights, a.k.a. patterns. In the absence of correlations, we validate the conjecture that the performance is independent of the number of teacher patters and hidden units of the student RBMs, and we argue that the teacher-student setting can be used as a toy model for studying the lottery ticket hypothesis. Beyond this regime, we find that the critical amount of data required to learn the teacher patterns decreases with both their number and correlations. In both regimes, we find that, even with a relatively large dataset, it becomes impossible to learn the teacher patterns if the inference temperature used for regularization is kept too low. In our framework, the student can learn teacher patterns one-to-one or many-to-one, generalizing previous findings about the teacher-student setting with two hidden units to any arbitrary finite number of hidden units.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes