MNMay 12, 2022
Detailed Balanced Chemical Reaction Networks as Generalized Boltzmann MachinesWilliam Poole, Thomas Ouldridge, Manoj Gopalkrishnan et al.
Can a micron sized sack of interacting molecules understand, and adapt to a constantly-fluctuating environment? Cellular life provides an existence proof in the affirmative, but the principles that allow for life's existence are far from being proven. One challenge in engineering and understanding biochemical computation is the intrinsic noise due to chemical fluctuations. In this paper, we draw insights from machine learning theory, chemical reaction network theory, and statistical physics to show that the broad and biologically relevant class of detailed balanced chemical reaction networks is capable of representing and conditioning complex distributions. These results illustrate how a biochemical computer can use intrinsic chemical noise to perform complex computations. Furthermore, we use our explicit physical model to derive thermodynamic costs of inference.
MNNov 2, 2023
Autonomous Learning of Generative Models with Chemical Reaction Network EnsemblesWilliam Poole, Thomas E. Ouldridge, Manoj Gopalkrishnan
Can a micron sized sack of interacting molecules autonomously learn an internal model of a complex and fluctuating environment? We draw insights from control theory, machine learning theory, chemical reaction network theory, and statistical physics to develop a general architecture whereby a broad class of chemical systems can autonomously learn complex distributions. Our construction takes the form of a chemical implementation of machine learning's optimization workhorse: gradient descent on the relative entropy cost function. We show how this method can be applied to optimize any detailed balanced chemical reaction network and that the construction is capable of using hidden units to learn complex distributions. This result is then recast as a form of integral feedback control. Finally, due to our use of an explicit physical model of learning, we are able to derive thermodynamic costs and trade-offs associated to this process.
DSMar 26, 2013
A Projection Argument for Differential Inclusions, with Applications to Persistence of Mass-Action KineticsManoj Gopalkrishnan, Ezra Miller, Anne Shiu
Motivated by questions in mass-action kinetics, we introduce the notion of vertexical family of differential inclusions. Defined on open hypercubes, these families are characterized by particular good behavior under projection maps. The motivating examples are certain families of reaction networks -- including reversible, weakly reversible, endotactic, and strongly endotactic reaction networks -- that give rise to vertexical families of mass-action differential inclusions. We prove that vertexical families are amenable to structural induction. Consequently, a trajectory of a vertexical family approaches the boundary if and only if either the trajectory approaches a vertex of the hypercube, or a trajectory in a lower-dimensional member of the family approaches the boundary. With this technology, we make progress on the global attractor conjecture, a central open problem concerning mass-action kinetics systems. Additionally, we phrase mass-action kinetics as a functor on reaction networks with variable rates.
ETApr 24, 2018
A reaction network scheme which implements the EM algorithmMuppirala Viswa Virinchi, Abhishek Behera, Manoj Gopalkrishnan
A detailed algorithmic explanation is required for how a network of chemical reactions can generate the sophisticated behavior displayed by living cells. Though several previous works have shown that reaction networks are computationally universal and can in principle implement any algorithm, there is scope for constructions that map well onto biological reality, make efficient use of the computational potential of the native dynamics of reaction networks, and make contact with statistical mechanics. We describe a new reaction network scheme for solving a large class of statistical problems including the problem of how a cell would infer its environment from receptor-ligand bindings. Specifically we show how reaction networks can implement information projection, and consequently a generalized Expectation-Maximization algorithm, to solve maximum likelihood estimation problems in partially-observed exponential families on categorical data. Our scheme can be thought of as an algorithmic interpretation of E. T. Jaynes's vision of statistical mechanics as statistical inference.
SYApr 22, 2016
A Cost / Speed / Reliability Trade-off to ErasingManoj Gopalkrishnan
We present a KL-control treatment of the fundamental problem of erasing a bit. We introduce notions of "reliability" of information storage via a reliability timescale $τ_r$, and "speed" of erasing via an erasing timescale $τ_e$. Our problem formulation captures the tradeoff between speed, reliability, and the Kullback-Leibler (KL) cost required to erase a bit. We show that rapid erasing of a reliable bit costs at least $\log 2 - \log\left(1 - \operatorname{e}^{-\frac{τ_e}{τ_r}}\right) > \log 2$, which goes to $\frac{1}{2} \log\frac{2τ_r}{τ_e}$ when $τ_r>>τ_e$.
LGAug 27, 2021
Active Inference for Stochastic ControlAswin Paul, Noor Sajid, Manoj Gopalkrishnan et al.
Active inference has emerged as an alternative approach to control problems given its intuitive (probabilistic) formalism. However, despite its theoretical utility, computational implementations have largely been restricted to low-dimensional, deterministic settings. This paper highlights that this is a consequence of the inability to adequately model stochastic transition dynamics, particularly when an extensive policy (i.e., action trajectory) space must be evaluated during planning. Fortunately, recent advancements propose a modified planning algorithm for finite temporal horizons. We build upon this work to assess the utility of active inference for a stochastic control setting. For this, we simulate the classic windy grid-world task with additional complexities, namely: 1) environment stochasticity; 2) learning of transition dynamics; and 3) partial observability. Our results demonstrate the advantage of using active inference, compared to reinforcement learning, in both deterministic and stochastic settings.
ETJun 22, 2019
A reaction network scheme which implements inference and learning for Hidden Markov ModelsAbhinav Singh, Carsten Wiuf, Abhishek Behera et al.
With a view towards molecular communication systems and molecular multi-agent systems, we propose the Chemical Baum-Welch Algorithm, a novel reaction network scheme that learns parameters for Hidden Markov Models (HMMs). Each reaction in our scheme changes only one molecule of one species to one molecule of another. The reverse change is also accessible but via a different set of enzymes, in a design reminiscent of futile cycles in biochemical pathways. We show that every fixed point of the Baum-Welch algorithm for HMMs is a fixed point of our reaction network scheme, and every positive fixed point of our scheme is a fixed point of the Baum-Welch algorithm. We prove that the "Expectation" step and the "Maximization" step of our reaction network separately converge exponentially fast. We simulate mass-action kinetics for our network on an example sequence, and show that it learns the same parameters for the HMM as the Baum-Welch algorithm.
NEJun 10, 2015
A Scheme for Molecular Computation of Maximum Likelihood Estimators for Log-Linear ModelsManoj Gopalkrishnan
We propose a novel molecular computing scheme for statistical inference. We focus on the much-studied statistical inference problem of computing maximum likelihood estimators for log-linear models. Our scheme takes log-linear models to reaction systems, and the observed data to initial conditions, so that the corresponding equilibrium of each reaction system encodes the corresponding maximum likelihood estimator. The main idea is to exploit the coincidence between thermodynamic entropy and statistical entropy. We map a Maximum Entropy characterization of the maximum likelihood estimator onto a Maximum Entropy characterization of the equilibrium concentrations for the reaction system. This allows for an efficient encoding of the problem, and reveals that reaction networks are superbly suited to statistical inference tasks. Such a scheme may also provide a template to understanding how in vivo biochemical signaling pathways integrate extensive information about their environment and history.
CRMay 23, 2013
A coercion-resistant protocol for conducting elections by telephoneManoj Gopalkrishnan
We present a protocol that allows voters to phone in their votes. Our protocol makes it expensive for a candidate and a voter to cooperate to prove to the candidate who the voter voted for. When the electoral pool is large enough, the cost to the candidate of manipulating sufficiently many votes to have an influence on the election results becomes impossibly expensive. Hence, the protocol provides candidates no incentive to attempt inducement or coercion of voters, resulting in free and fair elections with the promise of cost savings and higher voter turnout over traditional elections. One major inadequacy with our suggested protocol is that we assume the existence of a trusted election authority to count the votes.