This lesson is for subscribers
You've completed the free preview. Subscribe to unlock every lesson in every course.
Implement periodic hard updates or soft updates (Polyak averaging) to synchronize the target network with the Q-network.
You've completed the free preview. Subscribe to unlock every lesson in every course.