When asked for a bid, the seller s will provide one whose price is
greater than or equal to the cost for
producing it (i.e.
). From these prices, he
will chose the one with the highest expected profit:
The function returns the profit s expects to get if
he offers good g at a price p. It is also learned using
reinforcement learning, as follows:
where