This lesson is for subscribers
You've completed the free preview. Subscribe to unlock every lesson in every course.
Understanding embeddings that encode multiple modalities (text, image, audio) into a shared vector space for cross-modal retrieval.
You've completed the free preview. Subscribe to unlock every lesson in every course.