This lesson is for subscribers
You've completed the free preview. Subscribe to unlock every lesson in every course.
Using Gaussian policies and reparameterization for REINFORCE in continuous control tasks.
You've completed the free preview. Subscribe to unlock every lesson in every course.