This lesson is for subscribers
You've completed the free preview. Subscribe to unlock every lesson in every course.
Why gradients must be averaged (not summed) and how this maintains mathematical equivalence to single-GPU training.
You've completed the free preview. Subscribe to unlock every lesson in every course.