When asked for a bid, the seller s will provide one whose price is greater than or equal to the cost for producing it (i.e. ). From these prices, he will chose the one with the highest expected profit:
The function returns the profit s expects to get if he offers good g at a price p. It is also learned using reinforcement learning, as follows:
where