This lesson is for subscribers
You've completed the free preview. Subscribe to unlock every lesson in every course.
Derive the optimality condition where V*(s) equals the maximum over actions of expected returns.
You've completed the free preview. Subscribe to unlock every lesson in every course.