Michael King

CV
h-index43
5papers
88citations
Novelty37%
AI Score39

5 Papers

IVJul 20, 2022
Learning to estimate a surrogate respiratory signal from cardiac motion by signal-to-signal translation

Akshay Iyer, Clifford Lindsay, Hendrik Pretorius et al.

In this work, we develop a neural network-based method to convert a noisy motion signal generated from segmenting rebinned list-mode cardiac SPECT images, to that of a high-quality surrogate signal, such as those seen from external motion tracking systems (EMTs). This synthetic surrogate will be used as input to our pre-existing motion correction technique developed for EMT surrogate signals. In our method, we test two families of neural networks to translate noisy internal motion to external surrogate: 1) fully connected networks and 2) convolutional neural networks. Our dataset consists of cardiac perfusion SPECT acquisitions for which cardiac motion was estimated (input: center-of-count-mass - COM signals) in conjunction with a respiratory surrogate motion signal acquired using a commercial Vicon Motion Tracking System (GT: EMT signals). We obtained an average R-score of 0.76 between the predicted surrogate and the EMT signal. Our goal is to lay a foundation to guide the optimization of neural networks for respiratory motion correction from SPECT without the need for an EMT.

CVDec 19, 2024Code
Scaling 4D Representations

João Carreira, Dilara Gokay, Michael King et al.

Scaling has not yet been convincingly demonstrated for pure self-supervised learning from video. However, prior work has focused evaluations on semantic-related tasks $\unicode{x2013}$ action classification, ImageNet classification, etc. In this paper we focus on evaluating self-supervised learning on non-semantic vision tasks that are more spatial (3D) and temporal (+1D = 4D), such as camera pose estimation, point and object tracking, and depth estimation. We show that by learning from very large video datasets, masked auto-encoding (MAE) with transformer video models actually scales, consistently improving performance on these 4D tasks, as model size increases from 20M all the way to the largest by far reported self-supervised video model $\unicode{x2013}$ 22B parameters. Rigorous apples-to-apples comparison with many recent image and video models demonstrates the benefits of scaling 4D representations. Pretrained models are available at https://github.com/google-deepmind/representations4d .

OCApr 5
Gramians for a New Class of Nonlinear Control Systems Using Koopman and a Novel Generalized SVD

Brian Brown, Michael King

Certified model reduction for high-dimensional nonlinear control systems remains challenging: unlike balanced truncation for LTI systems, most nonlinear reduction methods either lack computable worst-case error bounds or rely on intractable PDEs. Data-driven Koopman/DMDc surrogates improve tractability, but standard \emph{input lifting} can distort the physical input-energy metric, so $H_\infty$ and Hankel-based bounds computed on the lifted model may be valid only in a lifted-input norm and need not certify the original system. We address this metric mismatch by a Generalized Singular Value Decomposition (GSVD)-based construction that represents general (including non-affine) input nonlinearities in an LTI-like lifted form with a \emph{pointwise norm-preserving} input map $v(x,u)$ satisfying $\|v(x,u)\|_2=\|u\|_2$ and constant matrices $A,B$. This preserves strict causality (constant $B$, no input-history augmentation) and yields computable Hankel-singular-value-based $H_\infty$ error certificates in the physical input norm for reduced-order surrogates. We illustrate the method on a 25-dimensional Hodgkin--Huxley network with saturating optogenetic actuation, reducing to a single dominant mode while retaining certified error bounds.

LGFeb 4, 2021
Alchemy: A benchmark and analysis toolkit for meta-reinforcement learning agents

Jane X. Wang, Michael King, Nicolas Porcel et al.

There has been rapidly growing interest in meta-learning as a method for increasing the flexibility and sample efficiency of reinforcement learning. One problem in this area of research, however, has been a scarcity of adequate benchmark tasks. In general, the structure underlying past benchmarks has either been too simple to be inherently interesting, or too ill-defined to support principled analysis. In the present work, we introduce a new benchmark for meta-RL research, emphasizing transparency and potential for in-depth analysis as well as structural richness. Alchemy is a 3D video game, implemented in Unity, which involves a latent causal structure that is resampled procedurally from episode to episode, affording structure learning, online inference, hypothesis testing and action sequencing based on abstract domain knowledge. We evaluate a pair of powerful RL agents on Alchemy and present an in-depth analysis of one of these agents. Results clearly indicate a frank and specific failure of meta-learning, providing validation for Alchemy as a challenging benchmark for meta-RL. Concurrent with this report, we are releasing Alchemy as public resource, together with a suite of analysis tools and sample agent trajectories.

CVJun 6, 2020
The Criminality From Face Illusion

Kevin W. Bowyer, Michael King, Walter Scheirer et al.

The automatic analysis of face images can generate predictions about a person's gender, age, race, facial expression, body mass index, and various other indices and conditions. A few recent publications have claimed success in analyzing an image of a person's face in order to predict the person's status as Criminal / Non-Criminal. Predicting criminality from face may initially seem similar to other facial analytics, but we argue that attempts to create a criminality-from-face algorithm are necessarily doomed to fail, that apparently promising experimental results in recent publications are an illusion resulting from inadequate experimental design, and that there is potentially a large social cost to belief in the criminality from face illusion.