Reinforcement Learning Problem
- The agent is in a state , takes action for
which it receives a reward of and ends up at :
- We define the problem:
finite set of states
finite set of actions
- We also assume that and depend
only on the current state and action.
José M. Vidal
.
4 of 22