Course contentsShow
AI Engineering
Lesson 70 of 1,8862. Working with Pre-trained ModelsPro lesson

Mixed Precision Inference

Using FP16 and BF16 instead of FP32 to reduce memory usage and increase speed.

This lesson is for subscribers

You've completed the free preview. Subscribe to unlock every lesson in every course.