When asked for a bid, the seller s will provide one whose price is
greater than or equal
to the cost for
producing it (i.e.
). From these prices, he
will chose the one with the highest expected profit:
The function
returns the profit s expects to get if
he offers good g at a price p. It is also learned using
reinforcement learning, as follows:
where