This lesson is for subscribers
You've completed the free preview. Subscribe to unlock every lesson in every course.
Understand why combining value estimation with policy learning reduces variance and improves sample efficiency.
You've completed the free preview. Subscribe to unlock every lesson in every course.