Course contentsShow
Machine Learning and Deep Learning
Lesson 3270 of 3,53871. Interpretability: Neural Network MethodsPro lesson

Activation Patching and Causal Interventions

Measuring the causal importance of activations by surgically replacing them and observing changes in model behavior.

This lesson is for subscribers

You've completed the free preview. Subscribe to unlock every lesson in every course.