Course contentsShow
Machine Learning and Deep Learning
Lesson 2681 of 3,53858. Model Compression: Pruning and DistillationPro lesson

The Distillation Loss Function

Combining distillation loss (teacher-student KL divergence) with ground truth loss for student training.

This lesson is for subscribers

You've completed the free preview. Subscribe to unlock every lesson in every course.