Udacity

Data Analysis with R (Udacity)

Offered by Udacity, Facebook,

Visually Analyze and Summarize Data Sets. Exploratory data analysis is an approach for summarizing and visualizing the important characteristics of a data set. Promoted by John Tukey, exploratory data analysis focuses on exploring data to understand the data’s underlying structure and variables, to develop intuition about the data set, to consider how that data set came into existence, and to decide how it can be investigated with more formal statistical methods.

Class Deals by MOOC List - Click here and see Udacity's Active Discounts, Deals, and Promo Codes.

If you're interested in supplemental reading material for the course check out the Exploratory Data Analysis book. (Not Required)
This course is also a part of our Data Analyst Nanodegree.
Udacity's Intro to Programming is your first step towards careers in Web and App Development, Machine Learning, Data Science, AI, and more! This program is perfect for beginners.

What You Will Learn

Lesson 1
What is EDA?

Start by learn about what exploratory data analysis (EDA) is and why it is important.

Lesson 2
R Basics

EDA
which comes before formal hypothesis testing and modeling
makes use of visual methods to analyze and summarize data sets.
R will be our tool for generating those visuals and conducting analyses.
We will install RStudio and packages
learn the layout and basic commands of R
practice writing basic R scripts
and inspect data sets.

Lesson 3
Explore One Variable

Perform EDA to understand the distribution of a variable and to check for anomalies and outliers.
Learn how to quantify and visualize individual variables within a data set to make sense of a pseudo-data set of Facebook users.
Create histograms and boxplots
transform variables
and examine tradeoffs in visualizations.

Lesson 4
Explore Two Variables

DA allows us to identify the most important variables and relationships within a data set before building predictive models.
Learn techniques for exploring the relationship between any two variables in a data set.
Create scatter plots
calculate correlations
and investigate conditional means.

Lesson 5
Explore Many Variables

Learn powerful methods and visualizations for examining relationships among multiple variables.
Reshape data frames and how to use aesthetics like color and shape to uncover more information
Continue to build intuition around the Facebook data set and explore some new data sets as well.

Lesson 6
Diamonds and Price Predictions

Investigate the diamonds data set alongside Facebook Data Scientist
Solomon Messing.
See how predictive modeling can allow us to determine a good price for a diamond.
As a final project

Prerequisites and Requirements
A background in statistics is helpful but not required. Consider taking Intro to Descriptive Statistics prior to taking this course. Relevant topics include:

Mean, median, mode
Normal, uniform, and skewed distributions
Histograms and box plots
Familiarity with the following CS and Math topics will help students:
Variable assignment
Comparison and logical operators ( , =, ==, &, | )
If else statements
Square roots, logarithms, and exponentials

Why Take This Course
You will...

Understand data analysis via EDA as a journey and a way to explore data
Explore data at multiple levels using appropriate visualizations
Acquire statistical knowledge for summarizing data
Demonstrate curiosity and skepticism when performing data analysis
Develop intuition around a data set and understand how the data was generated.

Go to Class

MOOC List is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

Related Courses

Coursera

The Chinese University of Hong Kong

Structural Equation Model and its Applications | 结构方程模型及其应用 (普通话) (Coursera)

Statistics & Data Analysis

在社会学、心理学、教育学、经济学、管理学、市场学等研究领域的数据分析中，结构方程建模是当前最前沿的统计方法中应用最广、研究最多的一个。它包含了方差分析、回归分析、路径分析和因子分析，弥补了传统回归分析和因子分析的不足，可以分析多因多果的联系、潜变量的关系，

Aug 17th 2026

5-12 Weeks

Data Analysis LISREL Regression Analysis

Coursera

Universidad Austral

Fundamentos de Excel para Negocios (Coursera)

Statistics & Data Analysis Data Science

Cuando finalices este curso habrás logrado un gran número de habilidades como introducir información, ordenarla, manipularla, realizar cálculos de diversa índole (matemáticos, trigonométricos, estadísticos, financieros, ingenieriles, probabilísticos), extraer conclusiones, trabajar con fechas y horas, construir gráficos, imprimir reportes y muchas más.

Aug 17th 2026

5-12 Weeks

Business Excel Data Analysis

Udacity

Intro to Data Science (Udacity)

Statistics & Data Analysis Data Science

Learn what it takes to become a data scientist. The Introduction to Data Science class will survey the foundational topics in data science, namely: Data Manipulation; Data Analysis with Statistics and Machine Learning; Data Communication with Information Visualization; Data at Scale -- Working with Big Data.

Self Paced

Self-Paced

Statistics Machine Learning Big Data

Udacity

Intro to Data Analysis (Udacity)

Statistics & Data Analysis Data Science

Explore a variety of datasets, posing and answering your own questions about each. You'll be using the Python libraries NumPy, Pandas, and Matplotlib. This course will introduce you to the world of data analysis.

Self Paced

Self-Paced

Python Data Analysis Data Science

EdX

Georgia Institute of Technology,GTx

Computing for Data Analysis (edX)

CS: Software Engineering Statistics & Data Analysis

A hands-on introduction to basic programming principles and practice relevant to modern data analysis, data mining, and machine learning. The modern data analysis pipeline involves collection, preprocessing, storage, analysis, and interactive visualization of data. In the course, you’ll see how computing and mathematics come together.

Aug 24th 2026

13-24 Weeks

Programming Python Computing

EdX

MIT,MITx

Fundamentals of Statistics (edX)

Statistics & Data Analysis

Develop a deep understanding of the principles that underpin statistical inference: estimation, hypothesis testing and prediction. Statistics is the science of turning data into insights and ultimately decisions. Behind recent advances in machine learning, data science and artificial intelligence are fundamental statistical principles. The purpose of this class is to develop and understand these core ideas on firm mathematical grounds starting from the construction of estimators and tests, as well as an analysis of their asymptotic performance.

Aug 26th 2026

13-24 Weeks

Statistics Linear Models Confidence Intervals

Coursera

Rice University

Business Statistics and Analysis Capstone (Coursera)

Statistics & Data Analysis Data Science

The Business Statistics and Analysis Capstone is an opportunity to apply various skills developed across the four courses in the specialization to a real life data. The Capstone, in collaboration with an industry partner uses publicly available ‘Housing Data’ to pose various questions typically a client would pose to a data analyst. Your job is to do the relevant statistical analysis and report your findings in response to the questions in a way that anyone can understand.

Aug 17th 2026

4 Weeks

Data Analysis Microsoft Excel Business Analysis

Coursera

University of Pennsylvania

A Crash Course in Causality: Inferring Causal Effects from Observational Data (Coursera)

Statistics & Data Analysis Data Science

We have all heard the phrase “correlation does not equal causation.” What, then, does equal causation? This course aims to answer that question and more! Over a period of 5 weeks, you will learn how causal effects are defined, what assumptions about your data and models are necessary, and how to implement and interpret some popular statistical methods. Learners will have the opportunity to apply these methods to example data in R (free statistical software environment).

Aug 17th 2026

5-12 Weeks

Statistics Probability Statistical Methods

Coursera

Erasmus University Rotterdam

Earth Economics (Coursera)

Economics & Finance

After this course you will be an Earth Economist that can provide evidence-based advise on the best global policy. As an Earth Economist you will better understand the behavior and advice of economists, have become a better economist yourself and know where to find Earth's data and how to analyze these world observations. Our planet is too important: we need you to get engaged!

Aug 17th 2026

5-12 Weeks

Economics Data Analysis Policy Analysis

Coursera

École Polytechnique Fédérale de Lausanne

Big Data Analysis with Scala and Spark (Coursera)

CS: Theory CS: Programming

Manipulating big data distributed over a cluster using functional concepts is rampant in industry, and is arguably one of the first widespread industrial uses of functional ideas. This is evidenced by the popularity of MapReduce and Hadoop, and most recently Apache Spark, a fast, in-memory distributed collections framework written in Scala. In this course, we'll see how the data parallel paradigm can be extended to the distributed case, using Spark throughout.

Aug 17th 2026

4 Weeks

Programming Algorithms SQL

Udacity

Udacity,Zipfian Academy

Data Visualization and D3.js (Udacity)

Statistics & Data Analysis Data Science

Communicating with Data. Learn the fundamentals of data visualization and practice communicating with data. This course covers how to apply design principles, human perception, color theory, and effective storytelling to data visualization. If you present data to others, aspire to be an analyst or data scientist, or if you’d like to become more technical with visualization tools, then you can grow your skills with this course.

Self Paced

Self-Paced

Storytelling Data Analysis Color Theory

Coursera

Johns Hopkins University

Python for Genomic Data Science (Coursera)

Statistics & Data Analysis Data Science

This class provides an introduction to the Python programming language and the iPython notebook. This is the third course in the Genomic Big Data Science Specialization from Johns Hopkins University.

Aug 17th 2026

4 Weeks

Programming Python Big Data