This lesson is for subscribers
You've completed the free preview. Subscribe to unlock every lesson in every course.
Representing policies as neural networks with continuous parameters that map states to action distributions.
You've completed the free preview. Subscribe to unlock every lesson in every course.