This lesson is for subscribers
You've completed the free preview. Subscribe to unlock every lesson in every course.
Define the optimal state-value and action-value functions that maximize expected return across all policies.
You've completed the free preview. Subscribe to unlock every lesson in every course.