Pricing: exploration-exploitation tradeoff

This paper provides an elegant solution to solve exploration-exploitation tradeoff in a "cold-start" pricing problem.

Exploration & Exploitation

  • need experimentation to learn demand curve before setting the optimal price (cold-start)
  • finding optimal price earlier ensures a higher profit

Solution: fine-tuned UCB algorithms

  • tuning exploration bonus item: considering price pkp_k and uncertainty 2δ^2\hat{\delta}
  • "shutoff" rule: do not explore dominated options

Contributions

  1. a novel combination of economic theory with machine learning to solve pricing problem
  2. introduce distribution-free theory of demand to improve existing algorithms theoretically and empirically
Short Summary
Model setup
Modified Algorithms
Some Thoughts
Pricing with Federated Learning
Xuhang Fan, Duke University
Dynamic Online Pricing Using MAB Experiments
2 / 19
2023/01/01