Course contentsShow
Machine Learning and Deep Learning
Lesson 1409 of 3,53831. Multimodal ModelsPro lesson

Visual Question Answering Task Definition

Understand the VQA task: given an image and a natural language question, predict the correct answer, and learn common VQA dataset characteristics.

This lesson is for subscribers

You've completed the free preview. Subscribe to unlock every lesson in every course.