This lesson is for subscribers
You've completed the free preview. Subscribe to unlock every lesson in every course.
Automatically generating optimized text suffixes that reliably trigger harmful model responses despite safety training.
You've completed the free preview. Subscribe to unlock every lesson in every course.