Serverless Data Processing with Dataflow: Operations en Español (Coursera)

Offered by Google Cloud,
Serverless Data Processing with Dataflow: Operations en Español (Coursera)

En esta última parte de la serie de cursos de Dataflow, presentaremos los componentes del modelo operativo de Dataflow. Examinaremos las herramientas y técnicas que permiten solucionar problemas y optimizar el rendimiento de las canalizaciones. Luego, revisaremos las prácticas recomendadas de las pruebas, la implementación y la confiabilidad en relación con las canalizaciones de Dataflow.

Class Deals by MOOC List - Click here and see Coursera's Active Discounts, Deals, and Promo Codes.

Concluiremos con una revisión de las plantillas, que facilitan el ajuste de escala de las canalizaciones de Dataflow para organizaciones con cientos de usuarios. Estas clases asegurarán que su plataforma de datos sea estable y resiliente ante circunstancias inesperadas.

What You Will Learn

  • Realizar actividades de supervisión, solución de problemas, pruebas y CI/CD en las canalizaciones de Dataflow.
  • Implementar canalizaciones de Dataflow pensando en la confiabilidad para maximizar la estabilidad de su plataforma de procesamiento de datos.

Syllabus

WEEK 1
Introducción
En este módulo, se aborda la descripción del curso.
Monitoring
En este módulo, aprenderemos a usar la página Lista de trabajos para filtrar los trabajos que deseamos supervisar o investigar. Observaremos cómo las pestañas Gráfico del trabajo, Información del trabajo y Métricas del trabajo brindan en conjunto un resumen completo de su trabajo de Dataflow. Por último, aprenderemos a usar la integración de Dataflow en el Explorador de métricas a fin de crear políticas de alertas para métricas de Dataflow.
Informes de errores y registros
En este módulo, aprenderemos a usar el panel Registro ubicado al final de las páginas Grafo de trabajo y Métricas del trabajo, y sobre la página Informes de errores centralizada.
Solución de problemas y depuración
En este módulo, aprenderemos a depurar canalizaciones de Dataflow y solucionar sus problemas. También revisaremos los cuatro modos habituales de fallas de Dataflow: falla en la compilación de canalizaciones, falla en el inicio de la canalización de Dataflow, falla en la ejecución de canalizaciones y problemas de rendimiento.

WEEK 2
Rendimiento
En este módulo, analizaremos las consideraciones de rendimiento que se deben tener presentes cuando se desarrollan canalizaciones por lotes y de transmisión en Dataflow.
Testing y CI/CD
En este módulo, analizaremos cómo realizar pruebas de unidades de las canalizaciones de Dataflow. También presentaremos los frameworks y las funciones disponibles a fin de optimizar su flujo de trabajo de CI/CD para las canalizaciones de Dataflow.
Confiabilidad
En este módulo, analizaremos métodos para compilar sistemas que sean resilientes ante la corrupción de datos y las interrupciones de los centros de datos.
Plantillas Flexibles
En este módulo, se abordan las plantillas flexibles, una función que ayuda a los equipos de ingeniería a estandarizar y reutilizar el código de las canalizaciones de Dataflow. Muchos desafíos operativos se pueden solucionar con las plantillas flexibles.
Resumen
En este módulo, se revisan los temas abordados en el curso.

Go to Class
MOOC List is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

Related Courses

Parallel Programming in Java (Coursera) Coursera
Rice University

Parallel Programming in Java (Coursera)

This course teaches learners (industry professionals and students) the fundamental concepts of parallel programming in the context of Java 8. Parallel programming enables developers to use multicore computers to make their applications run faster by using multiple processors at the same time. By the end of this course, you will learn how to use popular parallel Java frameworks (such as ForkJoin, Stream, and Phaser) to write parallel programs for a wide range of multicore platforms including servers, desktops, or mobile devices, while also learning about their theoretical foundations including computation graphs, ideal parallelism, parallel speedup, Amdahl's Law, data races, and determinism.

Jun 22nd 2026
4 Weeks
Supply Chain Logistics (Coursera) Coursera
Rutgers University

Supply Chain Logistics (Coursera)

Have you ever wondered how goods get delivered to us so quickly as soon as we order them? One word: Logistics! In this introductory Supply Chain Logistics course, I will take you on a journey to this fascinating backbone of global trade. We cover the three major building blocks of logistics networks: transportation, warehousing, and inventory. After completing this course, you will be able to differentiate the advantages and disadvantages of different modes of transportation. You will understand what goes into designing and setting up a warehousing facility. Finally, you will be able to select the options that enable you to develop logistics networks, that minimize costs and deliver top customer service.

Jun 15th 2026
4 Weeks
AWS Data Processing (Coursera) Coursera
Whizlabs

AWS Data Processing (Coursera)

AWS: Data Processing Course is the second course of AWS Certified Data Analytics Specialty Specialization. This course focuses on providing data processing solutions. The entire course is designed to teach learners the concept of EMR and Extract, Transform and Load. This course also put emphasis on ETL services and Data Processing solutions in AWS.

Jul 22nd 2024
3 Weeks
Serverless Data Processing with Dataflow:Foundations Español (Coursera) Coursera
Google Cloud

Serverless Data Processing with Dataflow:Foundations Español (Coursera)

Este curso corresponde a la 1ª parte de una serie de 3 cursos llamada Serverless Data Processing with Dataflow. Para comenzar, en el primer curso haremos un repaso de qué es Apache Beam y cómo se relaciona con Dataflow. Luego, hablaremos sobre la visión de Apache Beam y los beneficios que ofrece su framework de portabilidad. Dicho framework hace posible que un desarrollador pueda usar su lenguaje de programación favorito con su backend de ejecución preferido.

Jun 1st 2026
2 Weeks
Data Wrangling with Python Project (Coursera) Coursera
University of Colorado Boulder

Data Wrangling with Python Project (Coursera)

The "Data Wrangling Project" course provides students with an opportunity to apply the knowledge gained throughout the specialization in a real-life data wrangling project of their interest. Participants will follow the data wrangling pipeline step by step, from identifying data sources to processing and integrating data, to achieve a fine dataset ready for analysis. This course enables students to gain hands-on experience in the data wrangling process and prepares them to handle complex data challenges in real-world scenarios.

Jun 22nd 2026
5-12 Weeks
Oil & Gas Industry Operations and Markets (Coursera) Coursera
Duke University

Oil & Gas Industry Operations and Markets (Coursera)

The oil and gas industry has an enormous impact on all aspects of daily life. Individuals, corporations, and national governments make financial and policy decisions based on the cost, use, and availability of these two natural resources. This two-week course looks at the two most fundamental aspects of the oil and gas industry, its operations and markets, each of which is addressed as a separate module in the course.

Jun 10th 2026
2 Weeks
Data Science Companion (Coursera) Coursera
MathWorks

Data Science Companion (Coursera)

The Data Science Companion provides an introduction to data science. You will gain a quick background in data science and core machine learning concepts, such as regression and classification. You’ll be introduced to the practical knowledge of data processing and visualization using low-code solutions, as well as an overview of the ways to integrate multiple tools effectively to solve data science problems.

Jun 19th 2026
4 Weeks
Serverless Data Processing with Dataflow: Develop Pipelines (Coursera) Coursera
Google Cloud

Serverless Data Processing with Dataflow: Develop Pipelines (Coursera)

In this second installment of the Dataflow course series, we are going to be diving deeper on developing pipelines using the Beam SDK. We start with a review of Apache Beam concepts. Next, we discuss processing streaming data using windows, watermarks and triggers. We then cover options for sources and sinks in your pipelines, schemas to express your structured data, and how to do stateful transformations using State and Timer APIs.

Jun 1st 2026
3 Weeks
The Total Data Quality Framework (Coursera) Coursera
University of Michigan

The Total Data Quality Framework (Coursera)

By the end of this first course in the Total Data Quality specialization, learners will be able to: identify the essential differences between designed and gathered data and summarize the key dimensions of the Total Data Quality (TDQ) Framework; define the three measurement dimensions of the Total Data Quality framework, and describe potential threats to data quality along each of these dimensions for both gathered and designed data; define the three representation dimensions of the Total Data Quality framework, and describe potential threats to data quality along each of these dimensions for both gathered and designed data; and ; describe why data analysis defines an important dimension of the Total Data Quality framework, and summarize potential threats to the overall quality of an analysis plan for designed and/or gathered data.

Jun 22nd 2026
4 Weeks
Data Collection and Processing with Python (Coursera) Coursera
University of Michigan

Data Collection and Processing with Python (Coursera)

This course teaches you to fetch and process data from services on the Internet. It covers Python list comprehensions and provides opportunities to practice extracting from and processing deeply nested data. You'll also learn how to use the Python requests module to interact with REST APIs and what to look for in documentation of those APIs. For the final project, you will construct a “tag recommender” for the flickr photo sharing site.

Jun 15th 2026
3 Weeks