Course contentsShow
Machine Learning and Deep Learning
Lesson 672 of 3,53816. Activation Functions and Weight InitializationPro lesson

Layer-Specific Initialization

Special initialization for batch norm, layer norm, and residual connections to stabilize training.

This lesson is for subscribers

You've completed the free preview. Subscribe to unlock every lesson in every course.