This lesson is for subscribers
You've completed the free preview. Subscribe to unlock every lesson in every course.
Understand why policy iteration converges to the optimal policy in finite steps for finite MDPs.
You've completed the free preview. Subscribe to unlock every lesson in every course.