This lesson is for subscribers
You've completed the free preview. Subscribe to unlock every lesson in every course.
How alignment properties may degrade under distribution shift, with models reverting to misaligned behaviors in novel contexts.
You've completed the free preview. Subscribe to unlock every lesson in every course.