This lesson is for subscribers
You've completed the free preview. Subscribe to unlock every lesson in every course.
Express Q*(s,a) using the optimal value of the best action in successor states.
You've completed the free preview. Subscribe to unlock every lesson in every course.