Course contentsShow
Machine Learning and Deep Learning
Lesson 2279 of 3,53849. Deep Reinforcement Learning: Policy-BasedPro lesson

Baseline Subtraction and Variance Reduction

See how subtracting a baseline (value function) from returns reduces gradient variance without introducing bias.

This lesson is for subscribers

You've completed the free preview. Subscribe to unlock every lesson in every course.