This lesson is for subscribers
You've completed the free preview. Subscribe to unlock every lesson in every course.
How models can exploit reward model weaknesses, leading to high scores but poor actual quality.
You've completed the free preview. Subscribe to unlock every lesson in every course.