OCFeb 21, 2017
Iterative bidding in electricity markets: rationality and robustnessAshish Cherukuri, Jorge Cortes
This paper studies an electricity market consisting of an independent system operator (ISO) and a group of generators. The goal is to solve the DC optimal power flow (DC-OPF) problem: have the generators collectively meet the power demand while minimizing the aggregate generation cost and respecting line flow limits in the network. The ISO by itself cannot solve the DC-OPF problem as generators are strategic and do not share their cost functions. Instead, each generator submits to the ISO a bid, consisting of the price per unit of electricity at which it is willing to provide power. Based on the bids, the ISO decides how much production to allocate to each generator to minimize the total payment while meeting the load and satisfying the line limits. We provide a provably correct, decentralized iterative scheme, termed BID ADJUSTMENT ALGORITHM, for the resulting Bertrand competition game. Regarding convergence, we show that the algorithm takes the generators' bids to any desired neighborhood of the efficient Nash equilibrium at a linear convergence rate. As a consequence, the optimal production of the generators converges to the optimizer of the DC-OPF problem. Regarding robustness, we show that the algorithm is robust to affine perturbations in the bid adjustment scheme and that there is no incentive for any individual generator to deviate from the algorithm by using an alternative bid update scheme. We also establish the algorithm robustness to collusion, i.e., we show that, as long as each bus with generation has a generator following the strategy, there is no incentive for any group of generators to share information with the intent of tricking the system to obtain a higher payoff. Simulations illustrate our results.
OCSep 6, 2022
Cross apprenticeship learning framework: Properties and solution approachesAshwin Aravind, Debasish Chatterjee, Ashish Cherukuri
Apprenticeship learning is a framework in which an agent learns a policy to perform a given task in an environment using example trajectories provided by an expert. In the real world, one might have access to expert trajectories in different environments where the system dynamics is different while the learning task is the same. For such scenarios, two types of learning objectives can be defined. One where the learned policy performs very well in one specific environment and another when it performs well across all environments. To balance these two objectives in a principled way, our work presents the cross apprenticeship learning (CAL) framework. This consists of an optimization problem where an optimal policy for each environment is sought while ensuring that all policies remain close to each other. This nearness is facilitated by one tuning parameter in the optimization problem. We derive properties of the optimizers of the problem as the tuning parameter varies. Since the problem is nonconvex, we provide a convex outer approximation. Finally, we demonstrate the attributes of our framework in the context of a navigation task in a windy gridworld environment.
OCOct 10, 2018
Data-Driven Chance Constrained Optimization under Wasserstein Ambiguity SetsAshish R. Hota, Ashish Cherukuri, John Lygeros
We present a data-driven approach for distributionally robust chance constrained optimization problems (DRCCPs). We consider the case where the decision maker has access to a finite number of samples or realizations of the uncertainty. The chance constraint is then required to hold for all distributions that are close to the empirical distribution constructed from the samples (where the distance between two distributions is defined via the Wasserstein metric). We first reformulate DRCCPs under data-driven Wasserstein ambiguity sets and a general class of constraint functions. When the feasibility set of the chance constraint program is replaced by its convex inner approximation, we present a convex reformulation of the program and show its tractability when the constraint function is affine in both the decision variable and the uncertainty. For constraint functions concave in the uncertainty, we show that a cutting-surface algorithm converges to an approximate solution of the convex inner approximation of DRCCPs. Finally, for constraint functions convex in the uncertainty, we compare the feasibility set with other sample-based approaches for chance constrained programs.