EdX

Modernizing Data Lakes and Data Warehouses with Google Cloud (edX)

Offered by Google Cloud,
Modernizing Data Lakes and Data Warehouses with Google Cloud (edX)

This course is intended for developers who are responsible for: Querying datasets, visualizing query results, and creating reports. Specific job roles include: Data Engineer, Data Analyst, Database Administrators, Big Data Architects.

Class Deals by MOOC List - Click here and see EdX's Active Discounts, Deals, and Promo Codes.

The two key components of any data pipeline are data lakes and warehouses. This course highlights use-cases for each type of storage and dives into the available data lake and warehouse solutions on Google Cloud in technical detail. Also, this course describes the role of a data engineer, the benefits of a successful data pipeline to business operations, and examines why data engineering should be done in a cloud environment.
This is the first course of the Data Engineering on Google Cloud series. After completing this course, enroll in the Building Batch Data Pipelines on Google Cloud course.
This course is part of the Google Cloud Data Engineer Learning Path Professional Certificate.

What you'll learn

  • Differentiate between data lakes and data warehouses.
  • Explore use-cases for each type of storage and the available data lake and warehouse solutions on Google Cloud.
  • Discuss the role of a data engineer and the benefits of a successful data pipeline to business operations.
  • Examine why data engineering should be done in a cloud environment.

Prerequisites:
To benefit from this course, participants should have completed “Google Cloud Big Data and Machine Learning Fundamentals” or have equivalent experience. Participant should also have: • Basic proficiency with a common query language such as SQL. • Experience with data modeling and ETL (extract, transform, load) activities. • Experience with developing applications using a common programming language such as Python. • Familiarity with machine learning and/or statistics

Syllabus

  1. Introduction

This module introduces the Data Engineering on Google Cloud source series and this Modernizing Data Lakes and Data Warehouses with Google Cloud course.

  1. Introduction to Data Engineering

This module discusses the role of data engineering and motivates the claim why data engineering should be done in the Cloud.

  1. Building a Data Lake

In this module, we describe what data lake is and how to use Cloud Storage as your data lake on Google Cloud.

  1. Building a Data Warehouse

In this module, we talk about BigQuery as a data warehousing option on Google Cloud.

  1. Summary

A summary of the key learning points.

  1. Course Resources

Links to PDF versions of each module.

Go to Class
MOOC List is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

Related Courses

Preparing for the Google Cloud Professional Data Engineer Exam (Coursera) Coursera
Google Cloud

Preparing for the Google Cloud Professional Data Engineer Exam (Coursera)

From the course: "The best way to prepare for the exam is to be competent in the skills required of the job." This course uses a top-down approach to recognize knowledge and skills already known, and to surface information and skill areas for additional preparation. You can use this course to help create your own custom preparation plan. It helps you distinguish what you know from what you don't know. And it helps you develop and practice skills required of practitioners who perform this job.

Jun 27th 2026
5-12 Weeks
Building Scalable Java Microservices with Spring Boot and Spring Cloud (Coursera) Coursera
Google Cloud

Building Scalable Java Microservices with Spring Boot and Spring Cloud (Coursera)

"Microservices" describes a software design pattern in which an application is a collection of loosely coupled services. These services are fine-grained, and can be individually maintained and scaled. The microservices architecture is ideal for the public cloud, with its focus on elastic scaling with on-demand resources. In this course, you will learn how to build Java applications using Spring Boot and Spring Cloud on Google Cloud Platform.

Jun 23rd 2026
2 Weeks
Getting Started with Google Kubernetes Engine (edX) EdX
Google Cloud

Getting Started with Google Kubernetes Engine (edX)

This class is intended for the following participants: - Application developers, Cloud Solutions Architects, DevOps Engineers, IT managers. - Individuals using Google Cloud to create new solutions or to integrate existing systems, application environments, and infrastructure with Google Cloud. - Executives and business decision makers evaluating the potential of GKE to address business needs.

Self Paced
Self-Paced
AI Skills for Engineers: Data Engineering and Data Pipelines (edX) EdX
Delft University of Technology,DelftX

AI Skills for Engineers: Data Engineering and Data Pipelines (edX)

Good data is central to effective AI applications. This course teaches the basics of data for AI, covering what data is needed, how to extract data from existing databases and basic data skills including setup of a Python notebook environment, basic data exploration and simple data visualizations.

Self Paced
Self-Paced
Google Cloud Big Data and Machine Learning Fundamentals (edX) EdX
Google Cloud

Google Cloud Big Data and Machine Learning Fundamentals (edX)

Data Analysts, Data Engineers, Data Scientists, and ML Engineers who are getting started with Google Cloud. This course introduces the Google Cloud big data and machine learning products and services that support the data-to-AI lifecycle. It explores the processes, challenges, and benefits of building a big data pipeline and machine learning models with Vertex AI on Google Cloud.

Self Paced
Self-Paced