EdX

Big Data Technology Capstone Project (edX)

Big Data Technology Capstone Project (edX)

The Big Data Technology Capstone Project will allow you to apply the techniques and theory you have gained from the four courses in this MicroMasters program to a medium-scale project. In this capstone course, you will get an opportunity to apply the knowledge and skills that you have gained throughout this MicroMasters program.

Class Deals by MOOC List - Click here and see EdX's Active Discounts, Deals, and Promo Codes.

You can choose to complete any one project from a number of choices, covering topics ranging from data integration, data mining, Spark programming, to data analysis. After finishing the project, you will need to submit a report together with the code, to be reviewed by our TAs.

By completing this capstone project, you will create a showcase project and demonstrate to employers that you are job ready and a worthy candidate in the field of big data.
This course is part of the Big Data Technology MicroMasters.

What you'll learn

  • Apply your knowledge on big data technologies to a real-life scenario
  • Build a showcase project to demonstrate your knowledge and experience
  • How to independently work on a big data project

Prerequisites:
Candidates interested in pursuing this program are advised to complete the following courses before this course:

  • Foundations of Data Analytics
  • Data Mining and Knowledge Discovery
  • Big Data Computing with Spark
  • Mathematical Methods for Data Analysis
Go to Class
MOOC List is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

Related Courses

Datos para la efectividad de las políticas públicas (edX) EdX
Inter-American Development Bank - IDB,IDBx

Datos para la efectividad de las políticas públicas (edX)

Este curso te ayudará a tomar el control de los datos y familiarizarte con las herramientas para utilizarlos en la planificación, gestión y evaluación de políticas publicas. En esta era de la información, los datos están disponibles en todos lados y crecen a una tasa exponencial. ¿Cómo podemos darles sentido a todos los datos y aprovecharlos en el momento de tomar decisiones?, ¿cómo los utilizamos para que nos ayuden a guiar la gestión y planificación de nuestras políticas? Tanto si eres ciudadano como planificador de políticas, deberías poder responder a estas preguntas.

Self Paced
Self-Paced
Big Data Analytics Using Spark (edX) EdX
University of California, San Diego,UC San DiegoX

Big Data Analytics Using Spark (edX)

Learn how to analyze large datasets using Jupyter notebooks, MapReduce and Spark as a platform. In data science, data is called “big” if it cannot fit into the memory of a single standard laptop or workstation. The analysis of big datasets requires using a cluster of tens, hundreds or thousands of computers. Effectively using such clusters requires the use of distributed files systems, such as the Hadoop Distributed File System (HDFS) and corresponding computational models, such as Hadoop, MapReduce and Spark.

Dec 5th 2023
5-12 Weeks
Probability: Basic Concepts & Discrete Random Variables (edX) EdX
Purdue University,PurdueX

Probability: Basic Concepts & Discrete Random Variables (edX)

Learn fundamental concepts of mathematical probability to prepare for a career in the growing field of information and data science. Our capacity to collect and store data has exponentially increased, but deriving information from data from a scientific perspective requires a foundational knowledge of probability. Are you interested in a career in the emerging data science field, or as an actuarial scientist? Or want better to understand statistical theory and mathematical modeling?

No sessions available
5-12 Weeks
Introduction to Apache Spark (edX) EdX
University of California, Berkeley

Introduction to Apache Spark (edX)

Learn the fundamentals and architecture of Apache Spark, the leading cluster-computing framework among professionals. Spark is rapidly becoming the compute engine of choice for big data. Spark programs are more concise and often run 10-100 times faster than Hadoop MapReduce jobs. As companies realize this, Spark developers are becoming increasingly valued.

Not Available
Course Not Available
Analyzing and Visualizing Data with Power BI (edX) EdX
Davidson College,DavidsonX

Analyzing and Visualizing Data with Power BI (edX)

Step up your analytics game and learn one of the most in-demand job skills in the United States. Power BI is a robust business analytics and visualization tool from Microsoft that helps data professionals bring their data to life and tell more meaningful stores. This four-week course is a beginner's guide to working with data in Power BI and is perfect for professionals. You'll become confident in working with data, creating data visualizations, and preparing reports and dashboards.

Self Paced
Self-Paced
Enabling Technologies for Data Science and Analytics: The Internet of Things (edX) EdX
Columbia University,ColumbiaX

Enabling Technologies for Data Science and Analytics: The Internet of Things (edX)

Discover the relationship between Big Data and the Internet of Things (IoT). The Internet of Things is rapidly growing. It is predicted that more than 25 billion devices will be connected by 2020. In this data science course, you will learn about the major components of the Internet of Things and how data is acquired from sensors. You will also examine ways of analyzing event data, sentiment analysis, facial recognition software and how data generated from devices can be used to make decisions.

Self Paced
Self-Paced
Foundations of Data Analysis - Part 1: Statistics Using R (edX) EdX
University of Texas at Austin,UTAustinX

Foundations of Data Analysis - Part 1: Statistics Using R (edX)

Use R to learn fundamental statistical topics such as descriptive statistics and modeling. In this first part of a two part course, we’ll walk through the basics of statistical thinking – starting with an interesting question. Then, we’ll learn the correct statistical tool to help answer our question of interest – using R and hands-on Labs. Finally, we’ll learn how to interpret our findings and develop a meaningful conclusion.

No sessions available
5-12 Weeks
The Analytics Edge (edX) EdX
MIT,MITx

The Analytics Edge (edX)

Through inspiring examples and stories, discover the power of data and use analytics to provide an edge to your career and your life. In the last decade, the amount of data available to organizations has reached unprecedented levels. Data is transforming business, social interactions, and the future of our society. In this course, you will learn how to use data and analytics to give an edge to your career and your life.

This course is archived
13-24 Weeks
Quantitative Biology Workshop (edX) EdX
MIT,MITx

Quantitative Biology Workshop (edX)

A workshop-style introduction to tools used in biological research. Discover how to analyze data using computational methods. Do you have an interest in biology and quantitative tools? Do you know computational methods but do not realize how they apply to biological problems? Do you know biology but do not understand how scientists really analyze complicated data? 7.QBWx: Quantitative Biology Workshop is designed to give learners exposure to the application of quantitative tools to analyze biological data at an introductory level.

Self Paced
Self-Paced
Platform-Based Analytics (edX) EdX
Indiana University,IUx

Platform-Based Analytics (edX)

Gain hands-on experience extracting, preparing, exploring, and analyzing data statistically and visually using features and tools native to Microsoft Excel. In an ever-growing digital world, the need for strong data analysis skills is at the forefront of every business function, along with the ability to accurately describe and interpret analytical findings.

Nov 7th 2023
5-12 Weeks
Applications of Linear Algebra Part 2 (edX) EdX
Davidson College,DavidsonX

Applications of Linear Algebra Part 2 (edX)

Explore applications of linear algebra in the field of data mining by learning fundamentals of search engines, clustering movies into genres and of computer graphics by posterizing an image. Our world is in a data deluge with ever increasing sizes of datasets. Linear algebra is a tool to manage and analyze such data. This course is part 2 of a 2-part course, with this part extending smoothly from the first. Note, however, that part 1, is not a prerequisite for part 2.

No sessions available
4 Weeks