Course contentsShow
Machine Learning and Deep Learning
Lesson 2152 of 3,53846. Reinforcement Learning: FundamentalsPro lesson

The Bellman Optimality Equation for V*

Derive the optimality condition where V*(s) equals the maximum over actions of expected returns.

This lesson is for subscribers

You've completed the free preview. Subscribe to unlock every lesson in every course.