LG CL MLOct 18, 2023

Simple Mechanisms for Representing, Indexing and Manipulating Concepts

Yuanzhi Li, Raghu Meka, Rina Panigrahy, Kulin Shah

arXiv:2310.12143v22.01 citationsh-index: 33

Originality Highly original

AI Analysis

This provides a foundational approach for representing and indexing concepts in machine learning, which could enhance interpretability and hierarchical modeling.

The paper tackles the lack of a mathematical framework for defining and manipulating concepts in deep learning by proposing to represent concepts as zero sets of polynomials using moment statistics, enabling the discovery of hierarchical structures in data.

Supervised and unsupervised learning using deep neural networks typically aims to exploit the underlying structure in the training data; this structure is often explained using a latent generative process that produces the data, and the generative process is often hierarchical, involving latent concepts. Despite the significant work on understanding the learning of the latent structure and underlying concepts using theory and experiments, a framework that mathematically captures the definition of a concept and provides ways to operate on concepts is missing. In this work, we propose to characterize a simple primitive concept by the zero set of a collection of polynomials and use moment statistics of the data to uniquely represent the concepts; we show how this view can be used to obtain a signature of the concept. These signatures can be used to discover a common structure across the set of concepts and could recursively produce the signature of higher-level concepts from the signatures of lower-level concepts. To utilize such desired properties, we propose a method by keeping a dictionary of concepts and show that the proposed method can learn different types of hierarchical structures of the data.

View on arXiv PDF

Similar