This lesson is for subscribers
You've completed the free preview. Subscribe to unlock every lesson in every course.
Adding entropy bonus to encourage exploration by preventing premature convergence to deterministic policies.
You've completed the free preview. Subscribe to unlock every lesson in every course.