EdX

Analyzing Data with R (edX)

Offered by IBM,
Analyzing Data with R (edX)

R is the key that opens the door between the problems you want to solve with data and the answers you need. This course walks you through the process of answering questions through data. The R programming language is purpose-built for data analysis. R is the key that opens the door between the problems you want to solve with data and the answers you need to meet your objectives.

Class Deals by MOOC List - Click here and see EdX's Active Discounts, Deals, and Promo Codes.

This course starts with a question, and then walks you through the process of answering it through data. You will first learn important techniques for preparing (or wrangling) your data for analysis. Then you will learn how to gain a better understanding of your data through exploratory data analysis, helping you to summarize your data and identify relevant relationships between variables that can lead to insights. **
Once your data is ready to analyze, you will learn how to develop your model, evaluate it and tune its performance. By following this process, you can be sure that your data analysis performs to the standards that you have set, so that you can have confidence in the results.
**
By playing the role of a data analyst who is analyzing airline departure and arrival data to predict flight delays, you will build hands-on experience delivering insights using data. Using an Airline Reporting Carrier On-Time Performance Dataset, you will practice reading data files, preprocessing data, creating models, improving models, and evaluating them to ultimately choose the best one to use.
Note: The prerequisite for this course is basic R programming skills. For example, ensure that you have completed a course like Introduction to R Programming for Data Science from IBM.

This course is part of the Data Analytics and Visualization with Excel and R Professional Certificate and Applied Data Science with R Professional Certificate

What you'll learn

  • Prepare data for analysis by handling missing values, formatting and normalizing data, binning, and turning categorical values into numeric values.
  • Conduct exploratory data analysis using descriptive statistics, data grouping, analysis of variance (ANOVA), and correlation statistics.
  • Develop a predictive model using various regression methods.
  • Evaluate a model for overfitting and underfitting conditions and tune its performance using regularization and grid search.
Go to Class
MOOC List is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

Related Courses

Biostatistics for Big Data Applications (edX) EdX
University of Texas Medical Branch

Biostatistics for Big Data Applications (edX)

Learn data analysis basics for working with biomedical big data with practical hands-on examples using R. This course provides a broad foundation of statistical terms and concepts as well as an introduction to the R statistical software package. The topics covered are fundamental components of biostatistical methods used in both omics and population health research.

No sessions Available
5-12 Weeks
Introduction to Applied Biostatistics: Statistics for Medical Research (edX) EdX
Osaka University

Introduction to Applied Biostatistics: Statistics for Medical Research (edX)

Learn data analysis for medical research with practical hands-on examples using R Commander. Want to learn how to analyze real-world medical data, but unsure where to begin? This Applied Biostatistics course provides an introduction to important topics in medical statistical concepts and reasoning.

No sessions available
5-12 Weeks
Analytics for Decision Making (edX) EdX
Babson College

Analytics for Decision Making (edX)

Discover the foundational concepts that support modern data science and learn to analyze various data types and quality to make smart business decisions. Want to know how to avoid bad decisions with data? Making good decisions with data can give you a distinct competitive advantage in business. This statistics and data analysis course will help you understand the fundamental concepts of sound statistical thinking that can be applied in surprisingly wide contexts, sometimes even before there is any data! Key concepts like understanding variation, perceiving relative risk of alternative decisions, and pinpointing sources of variation will be highlighted.

Self Paced
Self-Paced
Data, Analytics and Learning (edX) EdX
University of Texas at Arlington,UTArlingtonX

Data, Analytics and Learning (edX)

An introduction to the logic and methods of analysis of data to improve teaching and learning. Capturing and analyzing data has changed how decisions are made and resources are allocated in businesses, journalism, government, and military and intelligence fields. Through better use of data, leaders are able to plan and enact strategies with greater clarity and confidence.

No sessions available
4 Weeks
Case Studies in Functional Genomics (edX) EdX
HarvardX,Harvard University

Case Studies in Functional Genomics (edX)

Perform RNA-Seq, ChIP-Seq, and DNA methylation data analyses, using open source software, including R and Bioconductor. We will explain how to perform the standard processing and normalization steps, starting with raw data, to get to the point where one can investigate relevant biological questions.

Self Paced
Self-Paced
Probability: Basic Concepts & Discrete Random Variables (edX) EdX
Purdue University,PurdueX

Probability: Basic Concepts & Discrete Random Variables (edX)

Learn fundamental concepts of mathematical probability to prepare for a career in the growing field of information and data science. Our capacity to collect and store data has exponentially increased, but deriving information from data from a scientific perspective requires a foundational knowledge of probability. Are you interested in a career in the emerging data science field, or as an actuarial scientist? Or want better to understand statistical theory and mathematical modeling?

No sessions available
5-12 Weeks
Introduction to Linear Models and Matrix Algebra (edX) EdX
HarvardX,Harvard University

Introduction to Linear Models and Matrix Algebra (edX)

Learn to use R programming to apply linear models to analyze data in life sciences. Matrix Algebra underlies many of the current tools for experimental design and the analysis of high-dimensional data. In this introductory data analysis course, we will use matrix algebra to represent the linear models that commonly used to model differences between experimental units. We perform statistical inference on these differences. Throughout the course we will use the R programming language.

Self Paced
Self-Paced
Enabling Technologies for Data Science and Analytics: The Internet of Things (edX) EdX
Columbia University,ColumbiaX

Enabling Technologies for Data Science and Analytics: The Internet of Things (edX)

Discover the relationship between Big Data and the Internet of Things (IoT). The Internet of Things is rapidly growing. It is predicted that more than 25 billion devices will be connected by 2020. In this data science course, you will learn about the major components of the Internet of Things and how data is acquired from sensors. You will also examine ways of analyzing event data, sentiment analysis, facial recognition software and how data generated from devices can be used to make decisions.

Self Paced
Self-Paced
Observation Theory: Estimating the Unknown (edX) EdX
Delft University of Technology,DelftX

Observation Theory: Estimating the Unknown (edX)

Learn how to estimate parameters from observational data for real-world engineering applications and assess the quality of the results. Are you an engineer, scientist or technician? Are you dealing with measurements or big data, but are you unsure about how to proceed? This is the course that teaches you how to find the best estimates of the unknown parameters from noisy observations. You will also learn how to assess the quality of your results.

Self Paced
Self-Paced
Quantitative Biology Workshop (edX) EdX
MIT,MITx

Quantitative Biology Workshop (edX)

A workshop-style introduction to tools used in biological research. Discover how to analyze data using computational methods. Do you have an interest in biology and quantitative tools? Do you know computational methods but do not realize how they apply to biological problems? Do you know biology but do not understand how scientists really analyze complicated data? 7.QBWx: Quantitative Biology Workshop is designed to give learners exposure to the application of quantitative tools to analyze biological data at an introductory level.

Self Paced
Self-Paced