This lesson is for subscribers
You've completed the free preview. Subscribe to unlock every lesson in every course.
Defining scalar rewards R(s,a,s') that guide agent behavior toward goals.
You've completed the free preview. Subscribe to unlock every lesson in every course.