Samuel A. Burden

h-index18

9papers

53citations

Novelty39%

AI Score26

Ranked #161,044 of 194,257 authors (top 83%)#5,135 in RO (top 76%)

9 Papers

1.2SYJan 10, 2022

On infinitesimal contraction analysis for hybrid systems

Samuel A. Burden, Thomas Libby, Samuel D. Coogan

Infinitesimal contraction analysis, wherein global asymptotic convergence results are obtained from local dynamical properties, has proven to be a powerful tool for applications in biological, mechanical, and transportation systems. The technique has primarily been developed for systems governed by a single, possibly nonsmooth, differential or difference equation. We generalize infinitesimal contraction analysis to hybrid systems governed by interacting differential and difference equations. Importantly, we leverage an intrinsic distance function to derive the first contraction results for hybrid systems without restrictions on mode sequence or dwell time. Our theoretical results are illustrated in several examples and applications.

5.8AIAug 26, 2024

Effect of Adaptation Rate and Cost Display in a Human-AI Interaction Game

Jason T. Isa, Bohan Wu, Qirui Wang et al.

As interactions between humans and AI become more prevalent, it is critical to have better predictors of human behavior in these interactions. We investigated how changes in the AI's adaptive algorithm impact behavior predictions in two-player continuous games. In our experiments, the AI adapted its actions using a gradient descent algorithm under different adaptation rates while human participants were provided cost feedback. The cost feedback was provided by one of two types of visual displays: (a) cost at the current joint action vector, or (b) cost in a local neighborhood of the current joint action vector. Our results demonstrate that AI adaptation rate can significantly affect human behavior, having the ability to shift the outcome between two game theoretic equilibrium. We observed that slow adaptation rates shift the outcome towards the Nash equilibrium, while fast rates shift the outcome towards the human-led Stackelberg equilibrium. The addition of localized cost information had the effect of shifting outcomes towards Nash, compared to the outcomes from cost information at only the current joint action vector. Future work will investigate other effects that influence the convergence of gradient descent games.

1.2GTJan 15, 2025

A Learning Algorithm That Attains the Human Optimum in a Repeated Human-Machine Interaction Game

Jason T. Isa, Lillian J. Ratliff, Samuel A. Burden

When humans interact with learning-based control systems, a common goal is to minimize a cost function known only to the human. For instance, an exoskeleton may adapt its assistance in an effort to minimize the human's metabolic cost-of-transport. Conventional approaches to synthesizing the learning algorithm solve an inverse problem to infer the human's cost. However, these problems can be ill-posed, hard to solve, or sensitive to problem data. Here we show a game-theoretic learning algorithm that works solely by observing human actions to find the cost minimum, avoiding the need to solve an inverse problem. We evaluate the performance of our algorithm in an extensive set of human subjects experiments, demonstrating consistent convergence to the minimum of a prescribed human cost function in scalar and multidimensional instantiations of the game. We conclude by outlining future directions for theoretical and empirical extensions of our results.

6.7AIMay 1, 2023

Human adaptation to adaptive machines converges to game-theoretic equilibria

Benjamin J. Chasnov, Lillian J. Ratliff, Samuel A. Burden

Adaptive machines have the potential to assist or interfere with human behavior in a range of contexts, from cognitive decision-making to physical device assistance. Therefore it is critical to understand how machine learning algorithms can influence human actions, particularly in situations where machine goals are misaligned with those of people. Since humans continually adapt to their environment using a combination of explicit and implicit strategies, when the environment contains an adaptive machine, the human and machine play a game. Game theory is an established framework for modeling interactions between two or more decision-makers that has been applied extensively in economic markets and machine algorithms. However, existing approaches make assumptions about, rather than empirically test, how adaptation by individual humans is affected by interaction with an adaptive machine. Here we tested learning algorithms for machines playing general-sum games with human subjects. Our algorithms enable the machine to select the outcome of the co-adaptive interaction from a constellation of game-theoretic equilibria in action and policy spaces. Importantly, the machine learning algorithms work directly from observations of human actions without solving an inverse problem to estimate the human's utility function as in prior work. Surprisingly, one algorithm can steer the human-machine interaction to the machine's optimum, effectively controlling the human's actions even while the human responds optimally to their perceived cost landscape. Our results show that game theory can be used to predict and design outcomes of co-adaptive interactions between intelligent humans and machines.

11.3OCMay 30, 2019

Convergence Analysis of Gradient-Based Learning with Non-Uniform Learning Rates in Non-Cooperative Multi-Agent Settings

Benjamin Chasnov, Lillian J. Ratliff, Eric Mazumdar et al.

Considering a class of gradient-based multi-agent learning algorithms in non-cooperative settings, we provide local convergence guarantees to a neighborhood of a stable local Nash equilibrium. In particular, we consider continuous games where agents learn in (i) deterministic settings with oracle access to their gradient and (ii) stochastic settings with an unbiased estimator of their gradient. Utilizing the minimum and maximum singular values of the game Jacobian, we provide finite-time convergence guarantees in the deterministic case. On the other hand, in the stochastic case, we provide concentration bounds guaranteeing that with high probability agents will converge to a neighborhood of a stable local Nash equilibrium in finite time. Different than other works in this vein, we also study the effects of non-uniform learning rates on the learning dynamics and convergence rates. We find that much like preconditioning in optimization, non-uniform learning rates cause a distortion in the vector field which can, in turn, change the rate of convergence and the shape of the region of attraction. The analysis is supported by numerical examples that illustrate different aspects of the theory. We conclude with discussion of the results and open questions.

1.7ROOct 18, 2017

Nonsmooth optimal value and policy functions in mechanical systems subject to unilateral constraints

Bora S. Banjanin, Samuel A. Burden

State-of-the-art approaches to optimal control use smooth approximations of value and policy functions and gradient-based algorithms for improving approximator parameters. Unfortunately, we show that value and policy functions that arise in optimal control of mechanical systems subject to unilateral constraints -- i.e. the contact-rich dynamics of robot locomotion and manipulation -- are generally nonsmooth due to the underlying dynamics exhibiting discontinuous or piecewise-differentiable trajectory outcomes. Simple mechanical systems are used to illustrate this result and the implications for optimal control of contact-rich robot dynamics.

8.7OCOct 17, 2016

Piecewise-differentiable trajectory outcomes in mechanical systems subject to unilateral constraints

Andrew M. Pace, Samuel A. Burden

We provide conditions under which trajectory outcomes in mechanical systems subject to unilateral constraints depend piecewise-differentiably on initial conditions, even as the sequence of constraint activations and deactivations varies. This builds on prior work that provided conditions ensuring existence, uniqueness, and continuity of trajectory outcomes, and extends previous differentiability results that applied only to fixed constraint (de)activation sequences. We discuss extensions of our result and implications for assessing stability and controllability.

6.7ROSep 13, 2016

Decoupled limbs yield differentiable trajectory outcomes through intermittent contact in locomotion and manipulation

Andrew Pace, Samuel A. Burden

When limbs are decoupled, we find that trajectory outcomes in mechanical systems subject to unilateral constraints vary differentiably with respect to initial conditions, even as the contact mode sequence varies.

2.1ROJul 13, 2016

A Hybrid Dynamical Extension of Averaging

Avik De, Samuel A. Burden, Daniel E. Koditschek

We extend a smooth dynamical systems averaging technique to a class of hybrid systems with a limit cycle that is particularly relevant to the synthesis of stable legged gaits. After introducing a definition of hybrid averageability sufficient to recover the classical result, we provide a simple illustration of its applicability to legged locomotion and conclude with some rather more speculative remarks concerning the prospects for further generalization of these ideas.