←
^
→
Reinforcement Learning
Example
$Q(s,a)$ values are associated with $s,a$ pairs.
$V(s)$ values decrease with distance from goal.
José M. Vidal
.
8 of 22