Course contentsShow
Machine Learning and Deep Learning
Lesson 3267 of 3,53871. Interpretability: Neural Network MethodsPro lesson

Toy Models for Mechanistic Analysis

Using small, controllable networks trained on synthetic tasks to develop interpretability techniques and validate hypotheses.

This lesson is for subscribers

You've completed the free preview. Subscribe to unlock every lesson in every course.