The Path to Insights: Data Models and Pipelines (Coursera)

Offered by Google,
The Path to Insights: Data Models and Pipelines (Coursera)

This is the second of three courses in the Google Business Intelligence Certificate. In this course, you'll explore data modeling and how databases are designed. Then you’ll learn about extract, transform, load (ETL) processes that extract data from source systems, transform it into formats that enable analysis, and drive business processes and goals.

Class Deals by MOOC List - Click here and see Coursera's Active Discounts, Deals, and Promo Codes.

Google employees who currently work in BI will guide you through this course by providing hands-on activities that simulate job tasks, sharing examples from their day-to-day work, and helping you build business intelligence skills to prepare for a career in the field.
Learners who complete the three courses in this certificate program will have the skills needed to apply for business intelligence jobs. This certificate program assumes prior knowledge of foundational analytical principles, skills, and tools covered in the Google Data Analytics Certificate.
By the end of this course, you will:
-Determine which data models are appropriate for different business requirements
-Describe the difference between creating and interacting with a data model
-Create data models to address different types of questions
-Explain the parts of the extract, transform, load (ETL) process and tools used in ETL
-Understand extraction processes and tools for different data storage systems
-Design an ETL process that meets organizational and stakeholder needs
-Design data pipelines to automate BI processes

Syllabus

WEEK 1
Data models and pipelines
You’ll start this course by exploring data modeling, common schemas, and database elements. You’ll consider how business needs determine the kinds of database systems that BI professionals implement. Then, you’ll discover pipelines and ETL processes, which are tools that move data and ensure that it’s accessible and useful.

WEEK 2
Dynamic database design
You’ll learn more about database systems, including data marts, data lakes, data warehouses, and ETL processes. You’ll also investigate the five factors of database performance: workload, throughput, resources, optimization, and contention. Finally, you’ll consider how to design efficient queries that get the most from a system.

WEEK 3
Optimize ETL processes
You’ll learn about optimization techniques including ETL quality testing, data schema validation, business rule verification, and general performance testing. You’ll also explore data integrity and learn how built-in quality checks defend against potential problems. Finally, you’ll focus on verifying business rules and general performance testing to make sure pipelines meet the intended business need.

WEEK 4
Course 2 end-of-course project
You’ll complete an end-of-course project by creating a pipeline process to deliver data to a target table and developing reports based on project needs. You’ll also ensure that the pipeline is performing correctly and that there are built-in defenses against data quality issues.

Go to Class
MOOC List is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

Related Courses

Data Warehouse Concepts, Design, and Data Integration (Coursera) Coursera
University of Colorado System

Data Warehouse Concepts, Design, and Data Integration (Coursera)

This is the second course in the Data Warehousing for Business Intelligence specialization. Ideally, the courses should be taken in sequence. In this course, you will learn exciting concepts and skills for designing data warehouses and creating data integration workflows. These are fundamental skills for data warehouse developers and administrators. You will have hands-on experience for data warehouse design and use open source products for manipulating pivot tables and creating data integration workflows.

Jun 22nd 2026
5-12 Weeks
Evaluation of Digital Health Interventions (Coursera) Coursera
Imperial College London

Evaluation of Digital Health Interventions (Coursera)

This course focuses on data, evaluation methods and the economic evaluation of digital health interventions. This module focuses on key data considerations for digital health including data management, data visualisation and methods for evaluating digital health interventions. The key focus is on experimental and quasi-experimental design approaches that can be applied to evaluating digital health interventions and key considerations for the economic evaluation of digital health interventions.

Jun 22nd 2026
4 Weeks
Databases and SQL for Data Science with Python(Coursera) Coursera
IBM

Databases and SQL for Data Science with Python(Coursera)

Much of the world's data resides in databases. SQL (or Structured Query Language) is a powerful language which is used for communicating with and extracting data from databases. A working knowledge of databases and SQL is a must if you want to become a data scientist. The purpose of this course is to introduce relational database concepts and help you learn and apply foundational knowledge of the SQL language. It is also intended to get you started with performing SQL access in a data science environment.

Jun 22nd 2026
4 Weeks
Building Database Applications in PHP (Coursera) Coursera
University of Michigan

Building Database Applications in PHP (Coursera)

In this course, we'll look at the object oriented patterns available in PHP. You'll learn how to connect to a MySQL using the Portable Data Objects (PDO) library and issue SQL commands in the the PHP language. We'll also look at how PHP uses cookies and manages session data. You'll learn how PHP avoids double posting data, how flash messages are implemented, and how to use a session to log in users in web applications.

Jun 22nd 2026
5-12 Weeks
IBM Data Privacy for Information Architecture (Coursera) Coursera
IBM

IBM Data Privacy for Information Architecture (Coursera)

Data privacy controls how information is collected, used, shared, and disposed of, in accordance with policies or external laws and regulations. In this course, students will gain an understanding of what data privacy is along with how to identify and understand typical data protection and privatization objectives that an enterprise may have, and how to choose a data protection approach.

Jun 22nd 2026
5-12 Weeks
Salesforce Basics (Coursera) Coursera
University of California, Irvine

Salesforce Basics (Coursera)

In this course, you will learn about what the world’s number one Customer Relationship Manager (CRM) system has to offer. You will begin this course by understanding the components that Salesforce leverages to make it an optimal system. You will learn about the basics in Lightning for Sales, Community Cloud and Marketing, and understanding how to secure your Salesforce Organization and Manage Permissions. These tools will serve as building blocks to implementing Salesforce into any organization. The course includes in-depth readings and practical application activities within Salesforce's Trailhead education platform, peer discussion opportunities, demonstration videos, and peer review assignments.

Jun 22nd 2026
3 Weeks
Managing Big Data in Clusters and Cloud Storage (Coursera) Coursera
Cloudera

Managing Big Data in Clusters and Cloud Storage (Coursera)

In this course, you'll learn how to manage big datasets, how to load them into clusters and cloud storage, and how to apply structure to the data so that you can run queries on it using distributed SQL engines like Apache Hive and Apache Impala. You’ll learn how to choose the right data types, storage systems, and file formats based on which tools you’ll use and what performance you need.

Jun 22nd 2026
5-12 Weeks
Foundations for Big Data Analysis with SQL (Coursera) Coursera
Cloudera

Foundations for Big Data Analysis with SQL (Coursera)

In this course, you'll get a big-picture view of using SQL for big data, starting with an overview of data, database systems, and the common querying language (SQL). Then you'll learn the characteristics of big data and SQL tools for working on big data platforms. You'll also install an exercise environment (virtual machine) to be used through the specialization courses, and you'll have an opportunity to do some initial exploration of databases and tables in that environment.

Jun 22nd 2026
5-12 Weeks
Health Information Technology Fundamentals (Coursera) Coursera
Johns Hopkins University

Health Information Technology Fundamentals (Coursera)

In this course you will receive an overview of the health IT ecosystem and the types of technologies IT support staff interact with the most. You will be introduced to the role of electronic health records (EHRs), clinical decision support, telemedicine, patient portals, and medical devices. We’ll cover common examples of how disruptions in these technologies can impact ongoing operations and routine clinical workflow. Although there is an interconnectedness to some aspects of healthcare, there are also limitations in data sharing. We want you to walk away from this course with an understanding of the family of technologies and tools that are critical in healthcare operations.

Jun 22nd 2026
4 Weeks
Relational database systems (Coursera) Coursera
Universidad Nacional Autónoma de México

Relational database systems (Coursera)

Welcome to the specialization course Relational Database Systems. This course will be completed on six weeks, it will be supported with videos and various documents that will allow you to learn in a very simple way how several types of information systems and databases are available to solve different problems and needs of the companies.

Jun 22nd 2026
5-12 Weeks