SDSTAT-MECHLGASNov 26, 2019

SchrödingeRNN: Generative Modeling of Raw Audio as a Continuously Observed Quantum State

arXiv:1911.11879v13 citations
Originality Synthesis-oriented
AI Analysis

This work addresses generative modeling of raw audio for audio processing applications, but it is incremental as it applies an existing quantum method to a new domain.

The authors tackled generative modeling of raw audio by introducing SchrödingeRNN, a quantum-inspired model based on stochastic Schrödinger equations, achieving results on synthetic stationary and non-stationary signals, marking the first use of continuous Matrix Product States in machine learning.

We introduce SchrödingeRNN, a quantum inspired generative model for raw audio. Audio data is wave-like and is sampled from a continuous signal. Although generative modelling of raw audio has made great strides lately, relational inductive biases relevant to these two characteristics are mostly absent from models explored to date. Quantum Mechanics is a natural source of probabilistic models of wave behaviour. Our model takes the form of a stochastic Schrödinger equation describing the continuous time measurement of a quantum system, and is equivalent to the continuous Matrix Product State (cMPS) representation of wavefunctions in one dimensional many-body systems. This constitutes a deep autoregressive architecture in which the systems state is a latent representation of the past observations. We test our model on synthetic data sets of stationary and non-stationary signals. This is the first time cMPS are used in machine learning.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes