EdX

Statistical Thinking for Data Science and Analytics (edX)

Statistical Thinking for Data Science and Analytics (edX)

Learn how statistics plays a central role in the data science approach. This statistics and data analysis course will pave the statistical foundation for our discussion on data science. You will learn how data scientists exercise statistical thinking in designing data collection, derive insights from visualizing data, obtain supporting evidence for data-based decisions and construct models for predicting future trends from data.

Class Deals by MOOC List - Click here and see EdX's Active Discounts, Deals, and Promo Codes.

What you'll learn:

  • Data collection, analysis and inference
  • Data classification to identify key traits and customers
  • Conditional Probability-How to judge the probability of an event, based on certain conditions
  • How to use Bayesian modeling and inference for forecasting and studying public opinion
  • Basics of Linear Regression
  • Data Visualization: How to create use data to create compelling graphics

This course is part of the Data Science for Executives Professional Certificate.

Course Syllabus

Week 1 – Introduction to Data Science

Week 2 – Statistical Thinking

  • Examples of Statistical Thinking
  • Numerical Data, Summary Statistics
  • From Population to Sampled Data
  • Different Types of Biases
  • Introduction to Probability
  • Introduction to Statistical Inference

Week 3 – Statistical Thinking 2

  • Association and Dependence
  • Association and Causation
  • Conditional Probability and Bayes Rule
  • Simpsons Paradox, Confounding
  • Introduction to Linear Regression
  • Special Regression Models

Week 4 – Exploratory Data Analysis and Visualization
Goals of statistical graphics and data visualization
Graphs of Data
Graphs of Fitted Models
Graphs to Check Fitted Models
What makes a good graph?
Principles of graphics

Week 5 – Introduction to Bayesian Modeling
Bayesian inference: combining models and data in a forecasting problem
Bayesian hierarchical modeling for studying public opinion
Bayesian modeling for Big Data

Go to Class
MOOC List is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

Related Courses

Probability - The Science of Uncertainty and Data (edX) EdX
MIT,MITx

Probability - The Science of Uncertainty and Data (edX)

Build foundational knowledge of data science with this introduction to probabilistic models, including random processes and the basic elements of statistical inference. The world is full of uncertainty: accidents, storms, unruly financial markets, noisy communications. The world is also full of data. Probabilistic modeling and the related field of statistical inference are the keys to analyzing data and making scientifically sound predictions.

Jan 29th 2024
13-24 Weeks
I "Heart" Stats: Learning to Love Statistics (edX) EdX
University of Notre Dame,NotreDameX

I "Heart" Stats: Learning to Love Statistics (edX)

Is your relationship with statistics dysfunctional? We can help: Get to know stats, build a healthy bond, and maybe even fall in love! When you meet a new person, it is hard to know what to expect. You may not be able to read the person or understand what they mean. Even if you want to have a good relationship with them, this lack of understanding can make interactions tense, unpredictable and scary!

No sessions available
4 Weeks
Data Visualization & Cloud Technologies (edX) EdX
University of Wisconsin–Madison,WisconsinX

Data Visualization & Cloud Technologies (edX)

Learn to use data visualization and cloud technologies for business analytics. In this course, gain experience in data visualization and cloud technologies to support business analytics. In the first half of the course, create and share compelling data visualizations to enhance decision-making. In the second half of the course, use cloud technologies to build scalable data warehouses, analyze big data, and develop and deploy machine learning models.

Mar 18th 2024
5-12 Weeks
Quantitative Biology Workshop (edX) EdX
MIT,MITx

Quantitative Biology Workshop (edX)

A workshop-style introduction to tools used in biological research. Discover how to analyze data using computational methods. Do you have an interest in biology and quantitative tools? Do you know computational methods but do not realize how they apply to biological problems? Do you know biology but do not understand how scientists really analyze complicated data? 7.QBWx: Quantitative Biology Workshop is designed to give learners exposure to the application of quantitative tools to analyze biological data at an introductory level.

Self Paced
Self-Paced
Introduction to Linear Models and Matrix Algebra (edX) EdX
HarvardX,Harvard University

Introduction to Linear Models and Matrix Algebra (edX)

Learn to use R programming to apply linear models to analyze data in life sciences. Matrix Algebra underlies many of the current tools for experimental design and the analysis of high-dimensional data. In this introductory data analysis course, we will use matrix algebra to represent the linear models that commonly used to model differences between experimental units. We perform statistical inference on these differences. Throughout the course we will use the R programming language.

Self Paced
Self-Paced
Supply Chain Analytics (edX) EdX
MIT,MITx

Supply Chain Analytics (edX)

Master and apply the core methodologies used in supply chain analysis and modeling, including statistics, regression, optimization and probability – part of the MITx Supply Chain Management MicroMasters Credential. Supply chains are complex systems involving multiple businesses and organizations with different goals and objectives. Many different analytical methods and techniques are used by researchers and practitioners alike to better design and manage their supply chains.

Jan 10th 2024
13-24 Weeks
Data Analysis: Statistical Modeling and Computation in Applications (edX) EdX
MIT,MITx

Data Analysis: Statistical Modeling and Computation in Applications (edX)

A hands-on introduction to the interplay between statistics and computation for the analysis of real data. -- Part of the MITx MicroMasters program in Statistics and Data Science. Data science requires multi-disciplinary skills ranging from mathematics, statistics, machine learning, problem solving to programming, visualization, and communication skills. In this course, learners will combine these foundational and practical skills with domain knowledge to ask and answer questions using real data.

May 13th 2024
13-24 Weeks
Computing for Data Analysis (edX) EdX
Georgia Institute of Technology,GTx

Computing for Data Analysis (edX)

A hands-on introduction to basic programming principles and practice relevant to modern data analysis, data mining, and machine learning. The modern data analysis pipeline involves collection, preprocessing, storage, analysis, and interactive visualization of data. In the course, you’ll see how computing and mathematics come together.

Aug 19th 2024
13-24 Weeks
Mathematical Methods for Quantitative Finance (edX) EdX
MIT,MITx

Mathematical Methods for Quantitative Finance (edX)

Learn the mathematical foundations essential for financial engineering and quantitative finance: linear algebra, optimization, probability, stochastic processes, statistics, and applied computational techniques in R. Modern finance is the science of decision making in an uncertain world, and its language is mathematics. As part of the MicroMasters® Program in Finance, this course develops the tools needed to describe financial markets, make predictions in the face of uncertainty, and find optimal solutions to business and investment decisions.

Jun 26th 2024
5-12 Weeks
Statistics Using Python (edX) EdX
University of Wisconsin–Madison,WisconsinX

Statistics Using Python (edX)

Learn the fundamentals of statistics using Python. This course is a compact primer in statistics as a foundation for data-driven business analysis. A selection of concepts include descriptive statistics, probability, inference, correlation, and regression. The course also exposes students to basic Python programming for use in statistics.

Jan 23rd 2024
5-12 Weeks