Course contentsShow
Machine Learning and Deep Learning
Lesson 3434 of 3,53875. LLM Safety and Alignment ChallengesPro lesson

Distributional Shift and Alignment Robustness

How alignment properties may degrade under distribution shift, with models reverting to misaligned behaviors in novel contexts.

This lesson is for subscribers

You've completed the free preview. Subscribe to unlock every lesson in every course.