EdX

R Data Science Capstone Project (edX)

Offered by IBM,
R Data Science Capstone Project (edX)

Apply various data analysis and visualization skills and techniques you have learned by taking on the role of a data scientist working with real-world data sets. In this capstone course, you will apply various data science skills and techniques that you have learned as part of the previous courses in the IBM Data Science with R or IBM Data Analytics with Excel and R Professional Certificate Programs.

Class Deals by MOOC List - Click here and see EdX's Active Discounts, Deals, and Promo Codes.

In this capstone project, you will take on the role of a data scientist who has recently joined an organization and is presented with a challenge that requires data collection, analysis, basic hypothesis testing, visualization, and modeling to be performed on real-world datasets. You will collect and understand data from multiple sources, conduct data wrangling and preparation with Tidyverse, perform exploratory data analysis with SQL, Tidyverse and ggplot2, model data with linear regression, create charts and plots to visualize the data, and build an interactive dashboard.
The project will culminate with a presentation of your data analysis report, with an executive summary for the various stakeholders in the organization.
This course is part of the Data Analytics and Visualization with Excel and R Professional Certificate and Applied Data Science with R Professional Certificate

What you'll learn

  • Prepare data for modelling by handling missing values, formatting and normalizing data, binning, and turning categorical values into numeric values.
  • Do exploratory data analysis using descriptive statistics, data grouping, data analysis and correlation statistics.
Go to Class
MOOC List is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

Related Courses

Statistical Predictive Modelling and Applications (edX) EdX
University of Edinburgh,EdinburghX

Statistical Predictive Modelling and Applications (edX)

Learn how to apply statistical modelling techniques to real-world business scenarios using Python. In this course, you will learn three predictive modelling techniques - linear and logistic regression, and naive Bayes - and their applications in real-world scenarios. The first half of the course focuses on linear regression. This technique allows you to model a continuous outcome variable using both continuous and categorical predictors. This technique enables you to predict product sales based on several customer variables.

Jan 18th 2022
5-12 Weeks
Predictive Analytics (edX) EdX
Indian Institute of Management, Bangalore,IIMBx

Predictive Analytics (edX)

Master the tools of predictive analytics in this statistics based analytics course. Decision makers often struggle with questions such as: What should be the right price for a product? Which customer is likely to default in his/her loan repayment? Which products should be recommended to an existing customer? Finding right answers to these questions can be challenging yet rewarding.

This course is archived
5-12 Weeks
Foundations of Data Analysis - Part 1: Statistics Using R (edX) EdX
University of Texas at Austin,UTAustinX

Foundations of Data Analysis - Part 1: Statistics Using R (edX)

Use R to learn fundamental statistical topics such as descriptive statistics and modeling. In this first part of a two part course, we’ll walk through the basics of statistical thinking – starting with an interesting question. Then, we’ll learn the correct statistical tool to help answer our question of interest – using R and hands-on Labs. Finally, we’ll learn how to interpret our findings and develop a meaningful conclusion.

No sessions available
5-12 Weeks
Knowledge Management and Big Data in Business (edX) EdX
The Hong Kong Polytechnic University,HKPolyUx

Knowledge Management and Big Data in Business (edX)

Learn why and how knowledge management and Big Data are vital to the new business era. The business landscape is changing so rapidly that traditional management, business and computing courses do not meet the needs for the next generation of workers in the business world. Most traditional methods are of a repetitive, rule-based nature and will be gradually replaced by Artificial Intelligence.

Self Paced
Self-Paced
Statistics for Business - II (edX) EdX
Indian Institute of Management, Bangalore,IIMBx

Statistics for Business - II (edX)

Examine data drawn from allied fields of business such as Finance and HR, and learn how to simulate data to follow a specified distribution. Statistics is a versatile discipline that has revolutionized the fields of business, engineering, medicine and pure sciences. This course is Part 2 of a 4-part series on Business Statistics, and is ideal for learners who wish to enroll in business programs. The first two parts cover topics in Descriptive Statistics, whereas the next two focus on Inferential Statistics.

No sessions available
5-12 Weeks
Datos para la efectividad de las políticas públicas (edX) EdX
Inter-American Development Bank - IDB,IDBx

Datos para la efectividad de las políticas públicas (edX)

Este curso te ayudará a tomar el control de los datos y familiarizarte con las herramientas para utilizarlos en la planificación, gestión y evaluación de políticas publicas. En esta era de la información, los datos están disponibles en todos lados y crecen a una tasa exponencial. ¿Cómo podemos darles sentido a todos los datos y aprovecharlos en el momento de tomar decisiones?, ¿cómo los utilizamos para que nos ayuden a guiar la gestión y planificación de nuestras políticas? Tanto si eres ciudadano como planificador de políticas, deberías poder responder a estas preguntas.

Self Paced
Self-Paced
Statistics Using Python (edX) EdX
University of Wisconsin–Madison,WisconsinX

Statistics Using Python (edX)

Learn the fundamentals of statistics using Python. This course is a compact primer in statistics as a foundation for data-driven business analysis. A selection of concepts include descriptive statistics, probability, inference, correlation, and regression. The course also exposes students to basic Python programming for use in statistics.

Jan 23rd 2024
5-12 Weeks
Analyzing and Visualizing Data with Power BI (edX) EdX
Davidson College,DavidsonX

Analyzing and Visualizing Data with Power BI (edX)

Step up your analytics game and learn one of the most in-demand job skills in the United States. Power BI is a robust business analytics and visualization tool from Microsoft that helps data professionals bring their data to life and tell more meaningful stores. This four-week course is a beginner's guide to working with data in Power BI and is perfect for professionals. You'll become confident in working with data, creating data visualizations, and preparing reports and dashboards.

Self Paced
Self-Paced
The Analytics Edge (edX) EdX
MIT,MITx

The Analytics Edge (edX)

Through inspiring examples and stories, discover the power of data and use analytics to provide an edge to your career and your life. In the last decade, the amount of data available to organizations has reached unprecedented levels. Data is transforming business, social interactions, and the future of our society. In this course, you will learn how to use data and analytics to give an edge to your career and your life.

This course is archived
13-24 Weeks