Course contentsShow
Machine Learning and Deep Learning
Lesson 2685 of 3,53858. Model Compression: Pruning and DistillationPro lesson

Attention Transfer and Relational Knowledge

Distilling attention maps and relational structures between features rather than raw activations.

This lesson is for subscribers

You've completed the free preview. Subscribe to unlock every lesson in every course.