GTAIAug 29, 2017

Learning to Price with Reference Effects

arXiv:1708.09020v12 citations
Originality Synthesis-oriented
AI Analysis

This addresses pricing optimization for firms facing consumer behavioral patterns, though it is incremental as it applies existing methods to a specific economic context.

The paper tackles the problem of a monopolist learning to price products when consumers exhibit reference effects, by framing it as a reinforcement learning problem using Thompson sampling and establishing a regret bound for performance guarantees.

As a firm varies the price of a product, consumers exhibit reference effects, making purchase decisions based not only on the prevailing price but also the product's price history. We consider the problem of learning such behavioral patterns as a monopolist releases, markets, and prices products. This context calls for pricing decisions that intelligently trade off between maximizing revenue generated by a current product and probing to gain information for future benefit. Due to dependence on price history, realized demand can reflect delayed consequences of earlier pricing decisions. As such, inference entails attribution of outcomes to prior decisions and effective exploration requires planning price sequences that yield informative future outcomes. Despite the considerable complexity of this problem, we offer a tractable systematic approach. In particular, we frame the problem as one of reinforcement learning and leverage Thompson sampling. We also establish a regret bound that provides graceful guarantees on how performance improves as data is gathered and how this depends on the complexity of the demand model. We illustrate merits of the approach through simulations.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes