This lesson is for subscribers
You've completed the free preview. Subscribe to unlock every lesson in every course.
Challenges of reward hacking, proxy objective misalignment, and ensuring the reward model generalizes beyond training data.
You've completed the free preview. Subscribe to unlock every lesson in every course.