OpenHPI

Introduction to Bayesian Data Analysis (openHPI)

Offered by Hasso-Plattner-Institut,

Bayesian data analysis is increasingly becoming the tool of choice for many data-analysis problems. This free course on Bayesian data analysis will teach you basic ideas about random variables and probability distributions, Bayes' rule, and its application in simple data analysis problems. You will learn to use the R package brms (which is a front-end for the probabilistic programming language Stan). The focus will be on regression modeling, culminating in a brief introduction to hierarchical models (otherwise known as mixed or multilevel models). This course is appropriate for anyone familiar with the programming language R and for anyone who has done some frequentist data analysis (e.g., linear modeling and/or linear mixed modeling) in the past.

Introduction: Why are Bayesian methods important for data analysts?
Here are some of the advantages of Bayesian methods over the standard frequentist approach used in data analysis:

Prior knowledge/expertise can be incorporated into the data analysis
Models can be flexibly specified to reflect the assumed generative process
The results of the analysis – the posterior distributions of the parameters of interest – have an intuitive interpretation
Hypothesis testing can be carried out in a more meaningful manner than the standard used null hypothesis significance testing

Prerequisites: Who is this course for?
We assume the following in this course:

Basic familiarity with the programming language R, openHPI offers a free R course for Beginners (in German)
Experience with data analysis using linear models
It is helpful (but not necessary) to have had some exposure to linear mixed models using the R library lme4
High-school mathematics (pre-calculus)
Some basic concepts from probability theory (sum and product rule, conditional probability)

This course is not appropriate for participants who don't know R programming and who have no experience at all with data analysis.

Course outcomes: What will you learn from this course?

Some basic ideas relating to random variables
Some fundamental properties of probability distributions
Application of Bayes' rule in data analysis
The concept of likelihood and its role in Bayesian statistical modeling
Bayesian regression models using brms (a front-end for Stan)
How to visualize and interpret prior and posterior distributions
How to generate prior and posterior predictive distributions for evaluating models
How to interpret the results of simple regression models

After completing this course, you will be in a good position to learn how to use more advanced Bayesian methods, such as hierarchical models, finite mixture models, multinomial processing tree models, measurement error models, etc.

What you'll learn

Bayesian statistics
Data analysis
Bayesian regression models using brms

Course contents

Week 0 - Initial Setup:
Installing R and RStudio, rstan, brms, and other necessary packages in R; Setting up R markdown for reproducible data analyses.

Week 1 - Introduction:
Learn the foundational ideas about random variables and probability distributions; Reading: Chapter 1 of the textbook (excluding the section on bivariate distributions).

Week 2 - Bayesian data analysis:
Understand Bayes' rule, derive the posterior using Bayes' rule; visualize the prior, likelihood, and posterior; distinguish the relationship between the prior, likelihood, and posterior; incorporate prior knowledge into the analysis; Reading: Chapter 2.

Week 3 - Computational Bayesian data analysis:
Derive the posterior through sampling; perform simple regression modeling of a simple button-pressing task using Stan/brms; do prior predictive distributions, sensitivity analysis, and different classes of prior; do posterior predictive distributions; derive the log-normal likelihood; Reading: Chapter 3.

Week 4 - Bayesian regression and hierarchical models:
Perform simple linear regressions using the normal and binomial likelihoods to answer the following research questions: (i) Does attentional load affect pupil size? (ii) Does trial id affect response times? (iii) Does set size affect recall accuracy? Take a brief look-ahead at linear mixed models; Reading: Chapter 4 and up to section 5.3 of chapter 5.

Final Exam:
Final Exam

Go to Class

MOOC List is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

Related Courses

Coursera

The Chinese University of Hong Kong

Structural Equation Model and its Applications | 结构方程模型及其应用 (普通话) (Coursera)

Statistics & Data Analysis

在社会学、心理学、教育学、经济学、管理学、市场学等研究领域的数据分析中，结构方程建模是当前最前沿的统计方法中应用最广、研究最多的一个。它包含了方差分析、回归分析、路径分析和因子分析，弥补了传统回归分析和因子分析的不足，可以分析多因多果的联系、潜变量的关系，

Aug 17th 2026

5-12 Weeks

Data Analysis LISREL Regression Analysis

Coursera

Nanjing University

Data Processing Using Python (Coursera)

CS: Software Engineering Statistics & Data Analysis

This course is mainly for non-computer majors. It starts with the basic syntax of Python, to how to acquire data in Python locally and from network, to how to present data, then to how to conduct basic and advanced statistic analysis and visualization of data, and finally to how to design a simple GUI to present and process data, advancing level by level.

Aug 10th 2026

5-12 Weeks

Python Data Structures Data Analysis

EdX

Georgia Institute of Technology,GTx

Computing for Data Analysis (edX)

CS: Software Engineering Statistics & Data Analysis

A hands-on introduction to basic programming principles and practice relevant to modern data analysis, data mining, and machine learning. The modern data analysis pipeline involves collection, preprocessing, storage, analysis, and interactive visualization of data. In the course, you’ll see how computing and mathematics come together.

Aug 24th 2026

13-24 Weeks

Programming Python Computing

OpenHPI

Hasso-Plattner-Institut

Data Science Bootcamp (openHPI)

CS: Programming Data Science

The ultimate goal of the bootcamp is to cultivate strong data science skills with an emphasis on machine learning techniques to satisfactorily meet and exceed the requests of the Data science world. In the process, we will develop good habits for operating independently as data scientists and for operating as members of productive data science teams.

Jun 7th 2023

4 Weeks

Machine Learning Data Analysis Data Science

Coursera

École Polytechnique Fédérale de Lausanne

Big Data Analysis with Scala and Spark (Coursera)

CS: Theory CS: Programming

Manipulating big data distributed over a cluster using functional concepts is rampant in industry, and is arguably one of the first widespread industrial uses of functional ideas. This is evidenced by the popularity of MapReduce and Hadoop, and most recently Apache Spark, a fast, in-memory distributed collections framework written in Scala. In this course, we'll see how the data parallel paradigm can be extended to the distributed case, using Spark throughout.

Aug 17th 2026

4 Weeks

Programming Algorithms SQL

Coursera

Emory University

Reproducible Templates for Analysis and Dissemination (Coursera)

Statistics & Data Analysis

This course will assist you with recreating work that a previous coworker completed, revisiting a project you abandoned some time ago, or simply reproducing a document with a consistent format and workflow. Incomplete information about how the work was done, where the files are, and which is the most recent version can give rise to many complications.

Aug 17th 2026

5-12 Weeks

Analysis Dynamic Data Analysis

Coursera

Erasmus University Rotterdam

Econometrics: Methods and Applications (Coursera)

Statistics & Data Analysis Data Science

Do you wish to know how to analyze and solve business and economic questions with data analysis tools? Then Econometrics by Erasmus University Rotterdam is the right course for you, as you learn how to translate data into models to make forecasts and to support decision making.

Aug 10th 2026

5-12 Weeks

Econometrics Data Analysis Linear Regression

Coursera

Duke University

Bayesian Statistics (Coursera)

Statistics & Data Analysis Data Science

This course describes Bayesian statistics, in which one's inferences about parameters or hypotheses are updated as evidence accumulates. You will learn to use Bayes’ rule to transform prior probabilities into posterior probabilities, and be introduced to the underlying theory and perspective of the Bayesian paradigm.

Aug 17th 2026

5-12 Weeks

Statistics Data Analysis R Programming

Coursera

Indian Institute of Management Ahmedabad (IIMA)

Pre-MBA Statistics (Coursera)

Statistics & Data Analysis

Welcome to the Pre-MBA Statistics course! By the end of this course, you will be able to describe how statistics can be used to summarize, analyze, and interpret data. This course introduces you to some aspects of descriptive and inferential statistics. You will learn to distinguish between various data types and describe the operations that you can execute with each type of data and the right tools to use.

Aug 17th 2026

5-12 Weeks

Statistics Probability Data Analysis

Coursera

University of Michigan

Applied Machine Learning in Python (Coursera)

Statistics & Data Analysis Data Science

This course will introduce the learner to applied machine learning, focusing more on the techniques and methods than on the statistics behind these methods. The course will start with a discussion of how machine learning is different than descriptive statistics, and introduce the scikit learn toolkit through a tutorial.

Aug 17th 2026

4 Weeks

Python ML Machine Learning

Coursera

University of Minnesota

Interprofessional Healthcare Informatics (Coursera)

Statistics & Data Analysis Data Science

Interprofessional Healthcare Informatics is a graduate-level, hands-on interactive exploration of real informatics tools and techniques offered by the University of Minnesota and the University of Minnesota's National Center for Interprofessional Practice and Education. We will be incorporating technology-enabled educational innovations to bring the subject matter to life. Over the 10 modules, we will create a vital online learning community and a working healthcare informatics network.

Aug 17th 2026

5-12 Weeks

Healthcare Informatics Telehealth

Coursera

Eindhoven University of Technology

Improving your statistical inferences (Coursera)

Statistics & Data Analysis Data Science

This course aims to help you to draw better statistical inferences from empirical research. First, we will discuss how to correctly interpret p-values, effect sizes, confidence intervals, Bayes Factors, and likelihood ratios, and how these statistics answer different questions you might be interested in. Then, you will learn how to design experiments where the false positive rate is controlled, and how to decide upon the sample size for your study, for example in order to achieve high statistical power.

Aug 10th 2026

5-12 Weeks

Statistics Statistical Inference Confidence Intervals