This lesson is for subscribers
You've completed the free preview. Subscribe to unlock every lesson in every course.
Using KL divergence to measure the distance between old and new policies in probability distribution space.
You've completed the free preview. Subscribe to unlock every lesson in every course.