DeepLearning.AI Courses - New course with Hugging Face: Quantization Fundamentals

5.0 (2)

18 learners

What you'll learn

This course includes

5.5 hours of video
Certificate of completion
Access on mobile and TV

Summary

Full Transcript

Enroll now: https://bit.ly/3VUbDMo Introducing a new short course: Quantization Fundamentals with Hugging Face. Generative AI models often exceed the capabilities of consumer-grade hardware and are expensive to run. Compressing models through methods such as quantization makes them more efficient, faster, and accessible, while minimizing performance degradation. Join this course and: - Learn to quantize any open source model with linear quantization using the Quanto library. - Get an overview of how linear quantization is implemented. This form of quantization can be applied to compress any model, including LLMs, vision models, etc. - Apply “downcasting,” another form of quantization, with the Transformers library, which enables you to load models in about half their normal size in the BFloat16 data type. By the end of this course, you’ll have a foundation in quantization techniques and be able to apply them to compress and optimize your own open source models, allowing them to run on a wide variety of devices, including smartphones, personal computers, and edge devices. Learn more: https://bit.ly/3VUbDMo

Continue this lesson in the app

Install CourseHive on Android or iOS to keep learning while you move.

Related Courses

Welcome

DeepLearning.AI Courses - New course with Hugging Face: Quantization Fundamentals

What you'll learn

This course includes

Summary

Full Transcript

Continue this lesson in the app

Related Courses

In-Depth Graphic Design Courses — Satori Graphics

Free Game Design Courses

Confidence Courses

🎓 Free Professional Courses with Certificates | Skills for Career Growth

FAQs