Course contentsShow
Machine Learning and Deep Learning
Lesson 1357 of 3,53830. Vision TransformersPro lesson

Patch Merging as Downsampling

How concatenating and projecting neighboring patches reduces spatial dimensions while expanding channels.

This lesson is for subscribers

You've completed the free preview. Subscribe to unlock every lesson in every course.