This lesson is for subscribers
You've completed the free preview. Subscribe to unlock every lesson in every course.
Adapting PPO for different action spaces using Gaussian policies or categorical distributions.
You've completed the free preview. Subscribe to unlock every lesson in every course.