Course contentsShow
Machine Learning and Deep Learning
Lesson 2317 of 3,53850. Deep Reinforcement Learning: Advanced Policy MethodsPro lesson

Deterministic Policy Gradients

The deterministic policy gradient theorem: why deterministic policies can be more sample-efficient than stochastic ones.

This lesson is for subscribers

You've completed the free preview. Subscribe to unlock every lesson in every course.