Course contentsShow
Machine Learning and Deep Learning
Lesson 2151 of 3,53846. Reinforcement Learning: FundamentalsPro lesson

Optimal Value Functions: V* and Q*

Define the optimal state-value and action-value functions that maximize expected return across all policies.

This lesson is for subscribers

You've completed the free preview. Subscribe to unlock every lesson in every course.