Coursera

Probability & Statistics for Machine Learning & Data Science (Coursera)

Offered by DeepLearning.AI,

Mathematics for Machine Learning and Data science is a foundational online program created in by DeepLearning.AI and taught by Luis Serrano. This beginner-friendly program is where you’ll master the fundamental mathematics toolkit of machine learning.

Class Deals by MOOC List - Click here and see Coursera's Active Discounts, Deals, and Promo Codes.

After completing this course, learners will be able to:
• Describe and quantify the uncertainty inherent in predictions made by machine learning models, using the concepts of probability, random variables, and probability distributions.
• Visually and intuitively understand the properties of commonly used probability distributions in machine learning and data science like Bernoulli, Binomial, and Gaussian distributions
• Apply common statistical methods like maximum likelihood estimation (MLE) and maximum a priori estimation (MAP) to machine learning problems
• Assess the performance of machine learning models using interval estimates and margin of errors
• Apply concepts of statistical hypothesis testing to commonly used tests in data science like AB testing
Many machine learning engineers and data scientists struggle with mathematics. Challenging interview questions often hold people back from leveling up in their careers, and even experienced practitioners can feel held by a lack of math skills.
This specialization uses innovative pedagogy in mathematics to help you learn quickly and intuitively, with courses that use easy-to-follow plugins and visualizations to help you see how the math behind machine learning actually works. Upon completion, you’ll understand the mathematics behind all the most common algorithms and data analysis techniques — plus the know-how to incorporate them into your machine learning career.
Course 3 of 3 in the Mathematics for Machine Learning and Data Science Specialization.

What You Will Learn

Describe and quantify the uncertainty inherent in predictions made by machine learning models
Visually and intuitively understand the properties of commonly used probability distributions in machine learning and data science
Apply common statistical methods like maximum likelihood estimation (MLE) and maximum a priori estimation (MAP) to machine learning problems
Assess the performance of machine learning models using interval estimates and margin of errors

Syllabus

WEEK 1
Week 1 - Introduction to Probability and Probability Distributions
In this week, you will learn about probability of events and various rules of probability to correctly do arithmetic with probabilities. You will learn the concept of conditional probability and the key idea behind Bayes theorem. In lesson 2, we generalize the concept of probability of events to probability distribution over random variables. You will learn about some common probability distributions like the Binomial distribution and the Normal distribution.

WEEK 2
Week 2 - Describing probability distributions and probability distributions with multiple variables
This week you will learn about different measures to describe probability distributions as well as any dataset. These include the measures of central tendency (mean, median, and mode), variance, skewness, and kurtosis. The concept of the expected value of a random variable is introduced to understand each of these measures. You will also learn about some visual tools to describe data and distributions. In lesson 2, you will learn about the probability distribution of two or more random variables using concepts like joint distribution, marginal distribution, and conditional distribution. You will end the week by learning about covariance: a generalization of variance to two or more random variables.

WEEK 3
Week 3 - Sampling and Point estimation
This week shifts its focus from probability and statistics. You will start by learning the concept of a sample and a population and two fundamental results from statistics that concern samples and population: the law of large numbers and the central limit theorem. In lesson 2, you will learn the first and the simplest method of estimation in statistics: point estimation. You will see how maximum likelihood estimation, the most common point estimation method, works and how it connects with regularization (technique used to reduce overfitting in machine learning) using Bayes theorem.

WEEK 4
Week 4 - Confidence Intervals and Hypothesis testing
This week you will learn another estimation method called interval estimation. The most common interval estimates are confidence intervals and you will see how they are calculated and how to correctly interpret them. In lesson 2, we cover the third estimation method called hypothesis testing where estimates are formulated as hypothesis and then tested in the presence of available evidence or sample of data. You will learn the concept of p-value that helps in making a decision for a hypothesis test and also learn some common tests like the t-test, two-sample t-test, and the paired t-test. We end the week with an interesting application of hypothesis testing in data science: A/B testing.

Go to Class

MOOC List is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

Related Courses

Coursera

Johns Hopkins University

Algorithms for DNA Sequencing (Coursera)

Statistics & Data Analysis Data Science

We will learn computational methods -- algorithms and data structures -- for analyzing DNA sequencing data. We will learn a little about DNA, genomics, and how DNA sequencing is used. We will use Python to implement key algorithms and data structures and to analyze real genomes and DNA sequencing datasets.

Jun 22nd 2026

4 Weeks

Python Algorithms DNA

Coursera

University of Washington

Data Manipulation at Scale: Systems and Algorithms (Coursera)

Statistics & Data Analysis Data Science

Data analysis has replaced data acquisition as the bottleneck to evidence-based decision making --- we are drowning in it. Extracting knowledge from large, heterogeneous, and noisy datasets requires not only powerful computing resources, but the programming abstractions to use them effectively. The abstractions that emerged in the last decade blend ideas from parallel databases, distributed systems, and programming languages to create a new class of scalable data analytics platforms that form the foundation for data science at realistic scales.

Jun 22nd 2026

4 Weeks

Algebra Algorithms Databases

Coursera

Scrimba

Learn to code with AI (Coursera)

CS: Software Engineering

Imagine waking up tomorrow as a web developer. What would you want to build? With AI tools like ChatGPT, you're already a developer, regardless of your experience, if you know how to work with them. So in this course, you'll build functional, interactive front-end projects while learning how to write effective prompts and debug and refine your code with the help of AI.

Jun 24th 2026

2 Weeks

Programming Artificial Intelligence HTML

Coursera

University of Washington

Communicating Data Science Results (Coursera)

Statistics & Data Analysis Data Science

Making predictions is not enough! Effective data scientists know how to explain and interpret their results, and communicate findings accurately to stakeholders to inform business decisions. Visualization is the field of research in computer science that studies effective communication of quantitative results by linking perception, cognition, and algorithms to exploit the enormous bandwidth of the human visual cortex. In this course you will learn to recognize, design, and use effective visualizations.

Jun 22nd 2026

3 Weeks

Ethics Cloud Computing Privacy

Coursera

University of Illinois at Urbana-Champaign

Inferential and Predictive Statistics for Business (Coursera)

Management & Leadership Statistics & Data Analysis

This course provides an analytical framework to help you evaluate key problems in a structured fashion and will equip you with tools to better manage the uncertainties that pervade and complicate business processes. The course aim to cover statistical ideas that apply to managers. We will consider two basic themes: first, is recognizing and describing variations present in everything around us, and then modeling and making decisions in the presence of these variations.

Jun 22nd 2026

4 Weeks

Business Statistical Inference Regression Models

Coursera

Johns Hopkins University

Exploratory Data Analysis (Coursera)

Statistics & Data Analysis Data Science

This course covers the essential exploratory techniques for summarizing data. These techniques are typically applied before formal modeling commences and can help inform the development of more complex statistical models. Exploratory techniques are also important for eliminating or sharpening potential hypotheses about the world that can be addressed by the data.

Jun 22nd 2026

4 Weeks

Statistics Data Analysis Data Science

Coursera

Johns Hopkins University

Reproducible Research (Coursera)

Statistics & Data Analysis Data Science

This course focuses on the concepts and tools behind reporting modern data analyses in a reproducible manner. Reproducible research is the idea that data analyses, and more generally, scientific claims, are published with their data and software code so that others may verify the findings and build upon them. The need for reproducibility is increasing dramatically as data analyses become more complex, involving larger datasets and more sophisticated computations.

Jun 22nd 2026

4 Weeks

Data Analysis Data Science Reproducible Research

Coursera

Universidad Nacional Autónoma de México

Introducción a Data Science: Programación Estadística con R (Coursera)

Statistics & Data Analysis Data Science

Este curso te proporcionará las bases del lenguaje de programación estadística R, la lengua franca de la estadística, el cual te permitirá escribir programas que lean, manipulen y analicen datos cuantitativos. Te explicaremos la instalación del lenguaje; también verás una introducción a los sistemas base de gráficos y al paquete para graficar ggplot2, para visualizar estos datos. Además también abordarás la utilización de uno de los IDEs más populares entre la comunidad de usuarios de R, llamado RStudio.

Jun 22nd 2026

4 Weeks

Data Analysis Data Science R Language

Coursera

Johns Hopkins University

Developing Data Products (Coursera)

Statistics & Data Analysis Data Science

A data product is the production output from a statistical analysis. Data products automate complex analysis tasks or use technology to expand the utility of a data informed model, algorithm or inference. This course covers the basics of creating data products using Shiny, R packages, and interactive graphics.

Jun 22nd 2026

4 Weeks

Statistics Data Products Statistical Analysis

Coursera

DeepLearning.AI

AI For Everyone (Coursera)

Business

AI is not only for engineers. If you want your organization to become better at using AI, this is the course to tell everyone--especially your non-technical colleagues--to take.

Jun 25th 2026

4 Weeks

Artificial Intelligence Machine Learning Neural Networks

Coursera

University of Washington

Machine Learning: Classification (Coursera)

Statistics & Data Analysis Data Science

Case Studies: Analyzing Sentiment & Loan Default Prediction. In our case study on analyzing sentiment, you will create models that predict a class (positive/negative sentiment) from input features (text of the reviews, user profile information,...). In our second case study for this course, loan default prediction, you will tackle financial data, and predict when a loan is likely to be risky or safe for the bank.

Jun 22nd 2026

5-12 Weeks

Python Machine Learning Classification

Coursera

McMaster University

Experimentation for Improvement (Coursera)

Statistics & Data Analysis Data Science

We are always using experiments to improve our lives, our community, and our work. Are you doing it efficiently? Or are you (incorrectly) changing one thing at a time and hoping for the best? In this course, you will learn how to plan efficient experiments - testing with many variables. Our goal is to find the best results using only a few experiments. A key part of the course is how to optimize a system.

Jun 22nd 2026

5-12 Weeks

Statistics Data Science Regression Models