Course contentsShow
Machine Learning and Deep Learning
Lesson 1411 of 3,53831. Multimodal ModelsPro lesson

Attention in VQA: Co-Attention and Bilinear Pooling

Understand co-attention mechanisms that jointly attend over image regions and question words, and bilinear pooling for feature fusion in VQA.

This lesson is for subscribers

You've completed the free preview. Subscribe to unlock every lesson in every course.