This lesson is for subscribers
You've completed the free preview. Subscribe to unlock every lesson in every course.
How pure greedy policies exploit current estimates but fail to discover potentially better actions.
You've completed the free preview. Subscribe to unlock every lesson in every course.