Course contentsShow
Machine Learning and Deep Learning
Lesson 1395 of 3,53831. Multimodal ModelsPro lesson

CLIP's Training Objective

Contrastive learning on image-text pairs: maximize cosine similarity for matched pairs, minimize for mismatches.

This lesson is for subscribers

You've completed the free preview. Subscribe to unlock every lesson in every course.