Introduction to Statistics & Data Analysis in Public Health (Coursera)

Introduction to Statistics & Data Analysis in Public Health (Coursera)

This course will teach you the core building blocks of statistical analysis - types of variables, common distributions, hypothesis testing - but, more than that, it will enable you to take a data set you've never seen before, describe its keys features, get to know its strengths and quirks, run some vital basic analyses and then formulate and test hypotheses based on means and proportions. You'll then have a solid grounding to move on to more sophisticated analysis and take the other courses in the series.

Class Deals by MOOC List - Click here and see Coursera's Active Discounts, Deals, and Promo Codes.

You'll learn the popular, flexible and completely free software R, used by statistics and machine learning practitioners everywhere. It's hands-on, so you'll first learn about how to phrase a testable hypothesis via examples of medical research as reported by the media. Then you'll work through a data set on fruit and vegetable eating habits: data that are realistically messy, because that's what public health data sets are like in reality. There will be mini-quizzes with feedback along the way to check your understanding. The course will sharpen your ability to think critically and not take things for granted: in this age of uncontrolled algorithms and fake news, these skills are more important than ever.
Course 1 of 4 in the Statistical Analysis with R for Public Health Specialization.

Prerequisites
Some formulae are given to aid understanding, but this is not one of those courses where you need a mathematics degree to follow it. You will need only basic numeracy (for example, we will not use calculus) and familiarity with graphical and tabular ways of presenting results. No knowledge of R or programming is assumed.

What You Will Learn

  • Defend the critical role of statistics in modern public health research and practice
  • Describe a data set from scratch, including data item features and data quality issues, using descriptive statistics and graphical methods in R
  • Select and apply appropriate methods to formulate and examine statistical associations between variables within a data set in R
  • Interpret the output from your analysis and appraise the role of chance and bias

Syllabus

WEEK 1
Introduction to Statistics in Public Health
Statistics has played a critical role of in public health research and practice, and you’ll start by looking at two examples: one from eighteenth century London and the other by the United Nations. The first task in carrying out a research study is to define the research question and express it as a testable hypothesis. With examples from the media, you’ll see what does and does not work in this regard, giving you a chance to define a research question from some real news stories.

WEEK 2
Types of Variables, Common Distributions and Sampling
This module will introduce you to some of the key building blocks of knowledge in statistical analysis: types of variables, common distributions and sampling. You’ll see the difference between “well-behaved” data distributions, such as the normal and the Poisson, and real-world ones that are common in public health data sets.

WEEK 3
Introduction to R and RStudio
Now it’s time to get started with the powerful and completely free statistical software R and its popular interface RStudio. With the example of fruit and vegetable consumption, you’ll learn how to download R, import the data set and run essential descriptive analyses to get to know the variables.

WEEK 4
Hypothesis Testing in R
Having learned how to define a research question and testable hypothesis earlier in the course, you’ll learn how to apply hypothesis testing in R and interpret the result. As all medical knowledge is derived from a sample of patients, random and other kinds of variation mean that what you measure on that sample, such as the average body mass index, is not necessarily the same as in the population as a whole. It’s essential that you incorporate this uncertainty in your estimate of average BMI when presenting it. This involves the calculation of a p value and confidence interval, fundamental concepts in statistical analysis. You’ll see how to do this for averages and proportions.

Go to Class
MOOC List is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

Related Courses

Pattern Discovery in Data Mining (Coursera) Coursera
University of Illinois at Urbana-Champaign

Pattern Discovery in Data Mining (Coursera)

Learn the general concepts of data mining along with basic methodologies and applications. Then dive into one subfield in data mining: pattern discovery. Learn in-depth concepts, methods, and applications of pattern discovery in data mining. We will also introduce methods for data-driven phrase mining and some interesting applications of pattern discovery. This course provides you the opportunity to learn skills and content to practice and engage in scalable pattern discovery methods on massive transactional data, discuss pattern evaluation measures, and study methods for mining diverse kinds of patterns, sequential patterns, and sub-graph patterns.

Jun 22nd 2026
4 Weeks
Regression Models (Coursera) Coursera
Johns Hopkins University

Regression Models (Coursera)

Linear models, as their name implies, relates an outcome to a set of predictors of interest using linear assumptions. Regression models, a subset of linear models, are the most important statistical analysis tool in a data scientist’s toolkit. This course covers regression analysis, least squares and inference using regression models.

Jun 22nd 2026
4 Weeks
Preventing Chronic Pain: A Human Systems Approach (Coursera) Coursera
University of Minnesota

Preventing Chronic Pain: A Human Systems Approach (Coursera)

Chronic pain is at epidemic levels and has become the highest-cost condition in health care. This course uses both creative and experiential learning to better understand chronic pain conditions and how they can be prevented through self-management in our cognitive, behavioral, physical, emotional, spiritual, social, and environmental realms.

Jun 22nd 2026
5-12 Weeks
Leadership Through Marketing (Coursera) Coursera
Northwestern University

Leadership Through Marketing (Coursera)

The success of every organization depends on attracting and retaining customers. Although the marketing concepts for doing so are well established, digital technology has empowered customers, while producing massive amounts of data, revolutionizing the processes through which organizations attract and retain customers. In this course, students will learn how to identify new opportunities to create value for empowered consumers, develop strategies that yield an advantage over rivals, and develop the data science skills to lead more effectively, allocate resources, and to confront this very challenging environment with confidence.

Jun 28th 2026
4 Weeks
Primeros Auxilios Psicológicos (PAP) (Coursera) Coursera
Universitat Autònoma de Barcelona

Primeros Auxilios Psicológicos (PAP) (Coursera)

Este curso on-demand (ABIERTO, se puede cursar en cualquier momento), impartido en castellano por la Universidad Autónoma de Barcelona y el Centro de Crisis de Barcelona, está destinado a entrenar en la aplicación de primeros auxilios psicológicos (PAP) a personas afectadas por situaciones altamente estresantes, abarcando tanto emergencias cotidianas (incidentes críticos estadísticamente frecuentes que afectan de manera muy intensa: un accidente de tráfico, una hospitalización, una agresión o la muerte traumática o repentina de una persona, etc.) como emergencias comunitarias y/o masivas (sucesos infrecuentes, que afectan a muchas personas o a una comunidad entera y que sobrepasan con mucho lo que sucede habitualmente en ella: una catástrofe natural, un accidente ferroviario o aéreo o un atentado).

Jun 22nd 2026
5-12 Weeks
Reproducible Research (Coursera) Coursera
Johns Hopkins University

Reproducible Research (Coursera)

This course focuses on the concepts and tools behind reporting modern data analyses in a reproducible manner. Reproducible research is the idea that data analyses, and more generally, scientific claims, are published with their data and software code so that others may verify the findings and build upon them. The need for reproducibility is increasing dramatically as data analyses become more complex, involving larger datasets and more sophisticated computations.

Jun 22nd 2026
4 Weeks
Exploratory Data Analysis (Coursera) Coursera
Johns Hopkins University

Exploratory Data Analysis (Coursera)

This course covers the essential exploratory techniques for summarizing data. These techniques are typically applied before formal modeling commences and can help inform the development of more complex statistical models. Exploratory techniques are also important for eliminating or sharpening potential hypotheses about the world that can be addressed by the data.

Jun 22nd 2026
4 Weeks
Effective Problem-Solving and Decision-Making (Coursera) Coursera
University of California, Irvine

Effective Problem-Solving and Decision-Making (Coursera)

Critical thinking – the application of scientific methods and logical reasoning to problems and decisions – is the foundation of effective problem solving and decision making. Critical thinking enables us to avoid common obstacles, test our beliefs and assumptions, and correct distortions in our thought processes. Gain confidence in assessing problems accurately, evaluating alternative solutions, and anticipating likely risks. Learn how to use analysis, synthesis, and positive inquiry to address individual and organizational problems and develop the critical thinking skills needed in today’s turbulent times. Using case studies and situations encountered by class members, explore successful models and proven methods that are readily transferable on-the-job.

Jun 22nd 2026
4 Weeks
Practical Machine Learning (Coursera) Coursera
Johns Hopkins University

Practical Machine Learning (Coursera)

One of the most common tasks performed by data scientists and data analysts are prediction and machine learning. This course will cover the basic components of building and applying prediction functions with an emphasis on practical applications. The course will provide basic grounding in concepts such as training and tests sets, overfitting, and error rates.

Jun 22nd 2026
4 Weeks
Diabetes - a Global Challenge (Coursera) Coursera
University of Copenhagen

Diabetes - a Global Challenge (Coursera)

Diabetes and obesity are growing health problems in rich and poor countries alike. With this course you will get updated on cutting-edge diabetes and obesity research including biological, genetic and clinical aspects as well as prevention and epidemiology of diabetes and obesity. All lectures are provided by high-profile scientists from one the world's leading universities in diabetes research.

Jun 22nd 2026
5-12 Weeks
Experimentation for Improvement (Coursera) Coursera
McMaster University

Experimentation for Improvement (Coursera)

We are always using experiments to improve our lives, our community, and our work. Are you doing it efficiently? Or are you (incorrectly) changing one thing at a time and hoping for the best? In this course, you will learn how to plan efficient experiments - testing with many variables. Our goal is to find the best results using only a few experiments. A key part of the course is how to optimize a system.

Jun 22nd 2026
5-12 Weeks