This lesson is for subscribers
You've completed the free preview. Subscribe to unlock every lesson in every course.
Show how optimal state values and action values relate: V*(s) = max_a Q*(s,a).
You've completed the free preview. Subscribe to unlock every lesson in every course.