Course contentsShow
Machine Learning and Deep Learning
Lesson 1188 of 3,53827. Pretrained Language Models: GPT Family and BeyondPro lesson

Teacher Forcing in Autoregressive Training

Using ground-truth tokens as input at each step during training to enable parallel computation of losses.

This lesson is for subscribers

You've completed the free preview. Subscribe to unlock every lesson in every course.