Course contentsShow
Machine Learning and Deep Learning
Lesson 2244 of 3,53848. Deep Reinforcement Learning: Value-BasedPro lesson

Target Network Updates

Implement periodic hard updates or soft updates (Polyak averaging) to synchronize the target network with the Q-network.

This lesson is for subscribers

You've completed the free preview. Subscribe to unlock every lesson in every course.