Course contentsShow
Machine Learning and Deep Learning
Lesson 1096 of 3,53824. The Transformer ArchitecturePro lesson

Cross-Attention Mechanism

How decoder attends to encoder outputs: queries from decoder, keys and values from encoder.

This lesson is for subscribers

You've completed the free preview. Subscribe to unlock every lesson in every course.