This lesson is for subscribers
You've completed the free preview. Subscribe to unlock every lesson in every course.
Exploring the architectural choice to use only the decoder stack and its implications for autoregressive generation.
You've completed the free preview. Subscribe to unlock every lesson in every course.