Course contentsShow
Machine Learning and Deep Learning
Lesson 2290 of 3,53850. Deep Reinforcement Learning: Advanced Policy MethodsPro lesson

The Policy Update Problem

Why unrestricted policy updates can cause performance collapse and the need for constrained optimization.

This lesson is for subscribers

You've completed the free preview. Subscribe to unlock every lesson in every course.