Quantization Fundamentals (Coursera)

Offered by DeepLearning.AI,
Quantization Fundamentals (Coursera)

Generative AI models, like large language models, often exceed the capabilities of consumer-grade hardware and are expensive to run. Compressing models through methods such as quantization makes them more efficient, faster, and accessible. This allows them to run on a wide variety of devices, including smartphones, personal computers, and edge devices, and minimizes performance degradation.

Class Deals by MOOC List - Click here and see Coursera's Active Discounts, Deals, and Promo Codes.

Join this course to:

  1. Quantize any open source model with linear quantization using the Quanto library.
  2. Get an overview of how linear quantization is implemented. This form of quantization can be applied to compress any model, including LLMs, vision models, etc.
  3. Apply “downcasting,” another form of quantization, with the Transformers library, which enables you to load models in about half their normal size in the BFloat16 data type.

By the end of this course, you will have a foundation in quantization techniques and be able to apply them to compress and optimize your own generative AI models, making them more accessible and efficient.

What you'll learn

  • Learn how to compress models with the Hugging Face Transformers library and the Quanto library.
  • Learn about linear quantization, a simple yet effective method for compressing models.
  • Practice quantizing open source multimodal and language models.
Go to Class
MOOC List is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

Related Courses

AI Workflow: Business Priorities and Data Ingestion (Coursera) Coursera
IBM

AI Workflow: Business Priorities and Data Ingestion (Coursera)

This is the first course of a six part specialization. You are STRONGLY encouraged to complete these courses in order as they are not individual independent courses, but part of a workflow where each course builds on the previous ones. This first course in the IBM AI Enterprise Workflow Certification specialization introduces you to the scope of the specialization and prerequisites.

Jun 8th 2026
2 Weeks
Learn to code with AI (Coursera) Coursera
Scrimba

Learn to code with AI (Coursera)

Imagine waking up tomorrow as a web developer. What would you want to build? With AI tools like ChatGPT, you're already a developer, regardless of your experience, if you know how to work with them. So in this course, you'll build functional, interactive front-end projects while learning how to write effective prompts and debug and refine your code with the help of AI.

Jun 10th 2026
2 Weeks
Introduction to Deep Learning & Neural Networks with Keras (Coursera) Coursera
IBM

Introduction to Deep Learning & Neural Networks with Keras (Coursera)

Looking to start a career in Deep Learning? Look no further. This course will introduce you to the field of deep learning and help you answer many questions that people are asking nowadays, like what is deep learning, and how do deep learning models compare to artificial neural networks? You will learn about the different deep learning models and build your first deep learning model using the Keras library.

Jun 8th 2026
5-12 Weeks
Ethical Issues in Data Science (Coursera) Coursera
University of Colorado Boulder

Ethical Issues in Data Science (Coursera)

Computing applications involving large amounts of data – the domain of data science – impact the lives of most people in the U.S. and the world. These impacts include recommendations made to us by internet-based systems, information that is available about us online, techniques that are used for security and surveillance, data that is used in health care, and many more. In many cases, they are affected by techniques in artificial intelligence and machine learning.

Jun 8th 2026
5-12 Weeks
Google Cloud Product Fundamentals em Português Brasileiro (Coursera) Coursera
Google Cloud

Google Cloud Product Fundamentals em Português Brasileiro (Coursera)

Este curso é uma continuação do "Business Transformation with Google Cloud" e guiará você pela jornada de transformação de uma organização do ponto de vista tecnológico. Explicaremos como as organizações podem fazer a transformação digital usando a tecnologia do Google Cloud nestas categorias: modernização da infraestrutura de TI; melhorias no processo de desenvolvimento dos aplicativos da empresa; uso do machine learning e da inteligência artificial para criar novo valor; a importância de ferramentas de produtividade como o G Suite na realização do trabalho; e compreender as oportunidades e os desafios da gestão do custo que uma infraestrutura de TI na nuvem traz.

Jun 8th 2026
5-12 Weeks
Deep learning in Electronic Health Records - CDSS 2 (Coursera) Coursera
University of Glasgow

Deep learning in Electronic Health Records - CDSS 2 (Coursera)

Overview of the main principles of Deep Learning along with common architectures. Formulate the problem for time-series classification and apply it to vital signals such as ECG. Applying this methods in Electronic Health Records is challenging due to the missing values and the heterogeneity in EHR, which include both continuous, ordinal and categorical variables. Subsequently, explore imputation techniques and different encoding strategies to address these issues. Apply these approaches to formulate clinical prediction benchmarks derived from information available in MIMIC-III database.

Jun 8th 2026
4 Weeks
AI and the Illusion of Intelligence (Coursera) Coursera
Copenhagen Business School

AI and the Illusion of Intelligence (Coursera)

Will AI soon be surpassing humans? This is rapidly becoming one of the central questions of our time -- but it is the wrong question. In this course, we will provide a non-technical look at where AI has come from, and where it is going. We will see that there is no reason to expect that AI will be surpassing humans. Instead, what we are learning to create with AI is the illusion of intelligence.

Jun 8th 2026
4 Weeks
AI Capstone Project with Deep Learning (Coursera) Coursera
IBM

AI Capstone Project with Deep Learning (Coursera)

In this capstone, learners will apply their deep learning knowledge and expertise to a real world challenge. They will use a library of their choice to develop and test a deep learning model. They will load and pre-process data for a real problem, build the model and validate it. Learners will then present a project report to demonstrate the validity of their model and their proficiency in the field of Deep Learning.

Jun 8th 2026
4 Weeks
Math for AI beginner part 1 Linear Algebra (Coursera) Coursera
Korea Advanced Institute of Science and Technology - KAIST

Math for AI beginner part 1 Linear Algebra (Coursera)

'Learn concept of AI such as machine learning, deep-learning, support vector machine which is related to linear algebra. Learn how to use linear algebra for AI algorithm. After completing this course, you are able to understand AI algorithm and basics of linear algebra for AI applications.

Jun 8th 2026
5-12 Weeks