Course contentsShow
Machine Learning and Deep Learning
Lesson 669 of 3,53816. Activation Functions and Weight InitializationPro lesson

He Initialization

Learning He initialization for ReLU networks and why it uses different scaling than Xavier.

This lesson is for subscribers

You've completed the free preview. Subscribe to unlock every lesson in every course.