Model Setup: MAB Pricing
Profits: The firm can determine the price from price set p∈{p1,p2,...pk} and face with demand D(p), thus the profit is π(p)=pD(p).
Price experimentation: Suppose by time t, the firm has charged pk a total of nkt times. Let πk,1,πk,2,…,πk,nkt be realizations of profit per consumer from every time that price pk has been charged.
- We assume that these are drawn from an unknown probability distribution with a mean at the true profit π(pk).
Pricing problem:
pt=Ψ({pτ,πτ∣τ=1,…,t−1})
Test