This lesson is for subscribers
You've completed the free preview. Subscribe to unlock every lesson in every course.
Why BERT uses only the encoder stack and how it differs from decoder-only or encoder-decoder models.
You've completed the free preview. Subscribe to unlock every lesson in every course.