This lesson is for subscribers
You've completed the free preview. Subscribe to unlock every lesson in every course.
How to represent stochastic policies using neural networks that output action probability distributions.
You've completed the free preview. Subscribe to unlock every lesson in every course.