This lesson is for subscribers
You've completed the free preview. Subscribe to unlock every lesson in every course.
Add policy entropy to the loss function to encourage exploration and prevent premature convergence.
You've completed the free preview. Subscribe to unlock every lesson in every course.