Course contentsShow
Machine Learning and Deep Learning
Lesson 2710 of 3,53859. Distributed Training: Data ParallelismPro lesson

Learning Rate Scaling Rules

Linear scaling rule and sqrt scaling rule for adjusting learning rates when increasing batch size and worker count.

This lesson is for subscribers

You've completed the free preview. Subscribe to unlock every lesson in every course.