Course contentsShow
Machine Learning and Deep Learning
Lesson 2142 of 3,53846. Reinforcement Learning: FundamentalsPro lesson

Value Functions: State Value V

Expected return from a state under policy π: V^π(s) = E[G_t | S_t=s].

This lesson is for subscribers

You've completed the free preview. Subscribe to unlock every lesson in every course.