This lesson is for subscribers
You've completed the free preview. Subscribe to unlock every lesson in every course.
Probability matching approach that samples from posterior distributions over action values for exploration.
You've completed the free preview. Subscribe to unlock every lesson in every course.