Course contentsShow
Machine Learning and Deep Learning
Lesson 2272 of 3,53849. Deep Reinforcement Learning: Policy-BasedPro lesson

REINFORCE Convergence Properties

Why REINFORCE is guaranteed to converge to local optima and how learning rate affects stability.

This lesson is for subscribers

You've completed the free preview. Subscribe to unlock every lesson in every course.