Entangling Machine Learning with Quantum Tensor Networks
It addresses language modeling for AI researchers, but is incremental as it builds on prior work.
This paper tackles the problem of modeling language using tensor networks by abstracting it to Motzkin spin chains, achieving near-perfect classification and stable performance with fewer training examples.
This paper examines the use of tensor networks, which can efficiently represent high-dimensional quantum states, in language modeling. It is a distillation and continuation of the work done in (van der Poel, 2023). To do so, we will abstract the problem down to modeling Motzkin spin chains, which exhibit long-range correlations reminiscent of those found in language. The Matrix Product State (MPS), also known as the tensor train, has a bond dimension which scales as the length of the sequence it models. To combat this, we use the factored core MPS, whose bond dimension scales sub-linearly. We find that the tensor models reach near perfect classifying ability, and maintain a stable level of performance as the number of valid training examples is decreased.