This lesson is for subscribers
You've completed the free preview. Subscribe to unlock every lesson in every course.
Reusing collected rollouts across multiple gradient steps while respecting trust region constraints.
You've completed the free preview. Subscribe to unlock every lesson in every course.