Software Architecture Patterns for Big Data (Coursera)

Software Architecture Patterns for Big Data (Coursera)

The course is intended for individuals looking to understand the architecture patterns necessary to take large software systems that make use of big data to production. You will transform big data prototypes into high quality tested production software. After measuring the performance characteristics of distributed systems, you will identify trouble areas and implement scalable solutions to improve performance. Upon completion of the course you will know how to scale production data stores to perform under load, designing load tests to ensure applications meet performance requirements.

Class Deals by MOOC List - Click here and see Coursera's Active Discounts, Deals, and Promo Codes.

Software Architecture Patterns for Big Data can be taken for academic credit as part of CU Boulder’s Master of Science in Data Science (MS-DS) degree offered on the Coursera platform. The MS-DS is an interdisciplinary degree that brings together faculty from CU Boulder’s departments of Applied Mathematics, Computer Science, Information Science, and others. With performance-based admissions and no application process, the MS-DS is ideal for individuals with a broad range of undergraduate education and/or professional experience in computer science, information science, mathematics, and statistics.
Course 2 of 3 in the Software Architecture for Big Data Specialization.

What You Will Learn

  • Compare, measure, and test big data models for production use.
  • Write custom performance tests to measure the characteristics of a distributed system.
  • Use queues to horizontally distribute large workloads.

Syllabus

WEEK 1
Predictive Models
In this module, you will learn how to write tests that allow you to iterate on predictive models.

WEEK 2
Performance of Distributed Systems
In this module, you will learn how to write performance tests to ensure your distributed system operates as expected in production.

WEEK 3
Horizontal Distribution of Large Workloads
In this module, you will learn how to use queues to horizontally distribute large workloads.

WEEK 4
Highly Available Distributed Systems
In this module, you will learn the advantages and disadvantages of high availability distributed systems.

Go to Class
MOOC List is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

Related Courses

Big Data Analysis with Scala and Spark (Coursera) Coursera
École Polytechnique Fédérale de Lausanne

Big Data Analysis with Scala and Spark (Coursera)

Manipulating big data distributed over a cluster using functional concepts is rampant in industry, and is arguably one of the first widespread industrial uses of functional ideas. This is evidenced by the popularity of MapReduce and Hadoop, and most recently Apache Spark, a fast, in-memory distributed collections framework written in Scala. In this course, we'll see how the data parallel paradigm can be extended to the distributed case, using Spark throughout.

Jun 22nd 2026
4 Weeks
Prepare for DP-100: Data Science on Microsoft Azure Exam (Coursera) Coursera
Microsoft

Prepare for DP-100: Data Science on Microsoft Azure Exam (Coursera)

Microsoft certifications give you a professional advantage by providing globally recognized and industry-endorsed evidence of mastering skills in digital and cloud businesses. In this course, you will prepare to take the DP-100 Azure Data Scientist Associate certification exam. You will refresh your knowledge of how to plan and create a suitable working environment for data science workloads on Azure, run data experiments, and train predictive models. In addition, you will recap on how to manage, optimize, and deploy machine learning models into production.

Jun 22nd 2026
5-12 Weeks
Laboratório de Programação Orientada a Objetos - Parte 2 (Coursera) Coursera
Universidade de São Paulo, Brasil

Laboratório de Programação Orientada a Objetos - Parte 2 (Coursera)

Esta é a segunda parte do curso avançado de Programação Orientada a Objetos oferecido pelo Departamento de Ciência da Computação do IME-USP. O curso é voltado para profissionais ou estudantes interessados em Desenvolvimento de Software Avançado e que já dominam uma linguagem de programação e já conhecem os fundamentos básicos da orientação a objetos.

Jun 22nd 2026
5-12 Weeks
Managing Big Data with MySQL (Coursera) Coursera
Duke University

Managing Big Data with MySQL (Coursera)

This course is an introduction to how to use relational databases in business analysis. You will learn how relational databases work, and how to use entity-relationship diagrams to display the structure of the data held within them. This knowledge will help you understand how data needs to be collected in business contexts, and help you identify features you want to consider if you are involved in implementing new data collection efforts.

Jun 22nd 2026
5-12 Weeks
Cloud Computing Applications, Part 2: Big Data and Applications in the Cloud (Coursera) Coursera
University of Illinois at Urbana-Champaign

Cloud Computing Applications, Part 2: Big Data and Applications in the Cloud (Coursera)

Welcome to the Cloud Computing Applications course, the second part of a two-course series designed to give you a comprehensive view on the world of Cloud Computing and Big Data! In this second course we continue Cloud Computing Applications by exploring how the Cloud opens up data analytics of huge volumes of data that are static or streamed at high velocity and represent an enormous variety of information. Cloud applications and data analytics represent a disruptive change in the ways that society is informed by, and uses information.

Jun 22nd 2026
4 Weeks
The Importance of Listening (Coursera) Coursera
Northwestern University

The Importance of Listening (Coursera)

In this second MOOC in the Social Marketing Specialization - "The Importance of Listening" - you will go deep into the Big Data of social and gain a more complete picture of what can be learned from interactions on social sites. You will be amazed at just how much information can be extracted from a single post, picture, or video.

Jun 22nd 2026
4 Weeks
Big Data Analysis with Scala and Spark (Scala 2 version) (Coursera) Coursera
École Polytechnique Fédérale de Lausanne

Big Data Analysis with Scala and Spark (Scala 2 version) (Coursera)

Manipulating big data distributed over a cluster using functional concepts is rampant in industry, and is arguably one of the first widespread industrial uses of functional ideas. This is evidenced by the popularity of MapReduce and Hadoop, and most recently Apache Spark, a fast, in-memory distributed collections framework written in Scala. In this course, we'll see how the data parallel paradigm can be extended to the distributed case, using Spark throughout.

Jun 22nd 2026
4 Weeks
Big Data, Genes, and Medicine (Coursera) Coursera
The State University of New York

Big Data, Genes, and Medicine (Coursera)

This course distills for you expert knowledge and skills mastered by professionals in Health Big Data Science and Bioinformatics. You will learn exciting facts about the human body biology and chemistry, genetics, and medicine that will be intertwined with the science of Big Data and skills to harness the avalanche of data openly available at your fingertips and which we are just starting to make sense of.

Jun 22nd 2026
5-12 Weeks
Introdução à Ciência e Engenharia de Dados (Coursera) Coursera
FIA Business School

Introdução à Ciência e Engenharia de Dados (Coursera)

Neste curso, você aprenderá que os dados se tornaram o principal ativo de negócios nos dias de hoje. Com o aumento do Big Data e criação de novas tecnologias, as organizações em todo o mundo estão inovando e descobrindo novas formas para analisar o potencial dos dados à sua disposição, o que ajuda no crescimento, na lucratividade, no direcionamento das operações gerais e no aumento da satisfação do cliente. Mas para que tudo isso funcione corretamente e seja possível extrair todo o potencial de forma precisa e que seja viável para o negócio, criou-se a área de ciência de dados.

Jun 22nd 2026
4 Weeks
Foundations for Big Data Analysis with SQL (Coursera) Coursera
Cloudera

Foundations for Big Data Analysis with SQL (Coursera)

In this course, you'll get a big-picture view of using SQL for big data, starting with an overview of data, database systems, and the common querying language (SQL). Then you'll learn the characteristics of big data and SQL tools for working on big data platforms. You'll also install an exercise environment (virtual machine) to be used through the specialization courses, and you'll have an opportunity to do some initial exploration of databases and tables in that environment.

Jun 22nd 2026
5-12 Weeks
Applications of Software Architecture for Big Data (Coursera) Coursera
University of Colorado Boulder

Applications of Software Architecture for Big Data (Coursera)

The course is intended for individuals who want to build a production-quality software system that leverages big data. You will apply the basics of software engineering and architecture to create a production-ready distributed system that handles big data. You will build data intensive, distributed system, composed of loosely coupled, highly cohesive applications.

Jun 22nd 2026
4 Weeks