A buyer b will start by requesting bids for a good g. She will then accept all bids for good g and will pick the seller:
The function returns the value the buyer expects to get if she buys good g at a price of p. It is learned using a simple form of reinforcement learning, namely:
where is the learning rate, p is the price b paid for the good, and q is the quality she ascribed to it. The learning rate is initially set to 1 and, like , is decreased until it reaches some fixed minimum value .