Course contentsShow
Machine Learning and Deep Learning
Lesson 2140 of 3,53846. Reinforcement Learning: FundamentalsPro lesson

Policies: Deterministic vs Stochastic

Representing agent behavior as mappings from states to actions: π(a|s) or π(s).

This lesson is for subscribers

You've completed the free preview. Subscribe to unlock every lesson in every course.