Course contentsShow
Machine Learning and Deep Learning
Lesson 2321 of 3,53850. Deep Reinforcement Learning: Advanced Policy MethodsPro lesson

Twin Delayed DDPG (TD3)

Three key improvements over DDPG: clipped double Q-learning, delayed policy updates, and target policy smoothing.

This lesson is for subscribers

You've completed the free preview. Subscribe to unlock every lesson in every course.