EdX

Python for Data Engineering Project (edX)

Offered by IBM,
Python for Data Engineering Project (edX)

An opportunity to apply your foundational Python skills via a project, using various techniques to collect and work with data. Journey into the realm of becoming a Data Engineer and apply your basic Python knowledge of working with data. You will exercise various techniques in Python to extract data in multiple file formats from different sources, transform it into specific datatypes, and then prepare it for loading it into a database.

Class Deals by MOOC List - Click here and see EdX's Active Discounts, Deals, and Promo Codes.

You will perform these tasks with the help of multiple hands-on labs using Jupyter notebooks and IBM Watson Studio.
On completion of this course, you will have the confidence to employ Python for data engineering tasks such as extracting large data sets from multiple sources through the use of webscraping and APIs, transforming the data and making it ready for gaining valuable business insights.
NOTE: This course is not intended to teach you Python basics and has limited instructional content. Rather, it is intended for you to apply prior Python knowledge.
PRE-REQUISITE: The Python Basics for Data Science course from IBM is a pre-requisite for this project course. Before taking this course, please ensure that you have either completed the Python Basics for Data Science from IBM or have equivalent proficiency in working with Python and data.
This course is part of the Data Engineering Fundamentals Professional Certificate.

What you'll learn
The objective of this course is to give you a solid understanding of what Data Engineering is.
In this course you will apply your Python skills for:

  • Webscraping and data extraction using APIs Transforming data into specific data types ****
  • Logging operations and preparing data for loading
  • Working with Jupyter notebooks and IBM Watson Studio

Syllabus

Module 1: Python Project for Data Engineering

  • ****Collect data using APIs and Webscraping
  • Extract data from different file formats
  • Transform data and prepare for loading
  • Log data operations
  • Share your Jupyter notebook in Watson Studio
  • Submit work and review your peers
Go to Class
MOOC List is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

Related Courses

Computing for Data Analysis (edX) EdX
Georgia Institute of Technology,GTx

Computing for Data Analysis (edX)

A hands-on introduction to basic programming principles and practice relevant to modern data analysis, data mining, and machine learning. The modern data analysis pipeline involves collection, preprocessing, storage, analysis, and interactive visualization of data. In the course, you’ll see how computing and mathematics come together.

Aug 19th 2024
13-24 Weeks
Statistical Predictive Modelling and Applications (edX) EdX
University of Edinburgh,EdinburghX

Statistical Predictive Modelling and Applications (edX)

Learn how to apply statistical modelling techniques to real-world business scenarios using Python. In this course, you will learn three predictive modelling techniques - linear and logistic regression, and naive Bayes - and their applications in real-world scenarios. The first half of the course focuses on linear regression. This technique allows you to model a continuous outcome variable using both continuous and categorical predictors. This technique enables you to predict product sales based on several customer variables.

Jan 18th 2022
5-12 Weeks
Python for Data Science (edX) EdX
University of California, San Diego,UC San DiegoX

Python for Data Science (edX)

Learn to use powerful, open-source, Python tools, including Pandas, Git and Matplotlib, to manipulate, analyze, and visualize complex datasets. In the information age, data is all around us. Within this data are answers to compelling questions across many societal domains (politics, business, science, etc.). But if you had access to a large dataset, would you be able to find the answers you seek?

Self Paced
Self-Paced
Introduction to Computational Thinking and Data Science (edX) EdX
MIT,MITx

Introduction to Computational Thinking and Data Science (edX)

This course is an introduction to using computation to understand real-world phenomena. This course will teach you how to use computation to accomplish a variety of goals and provides you with a brief introduction to a variety of topics in computational problem solving. This course is aimed at students with some prior programming experience in Python and a rudimentary knowledge of computational complexity.

Mar 20th 2024
5-12 Weeks
SQL for Data Science (edX) EdX
IBM

SQL for Data Science (edX)

Learn how to use and apply the powerful language of SQL to better communicate and extract data from databases - a must for anyone working in the data science field. Much of the world's data lives in databases. SQL (or Structured Query Language) is a powerful programming language that is used for communicating with and extracting various data types from databases.

Self Paced
Self-Paced
HTML5 Apps and Games (edX) EdX
World Wide Web Consortium - W3C,W3Cx

HTML5 Apps and Games (edX)

Today, developers are increasingly moving from native to HTML5-based apps. Increase your ability to design and deliver innovative services on the Web! Want to learn advanced HTML5 tips and techniques? This is the course for you! Find out more about the powerful Web features that will help you create great content and apps.

Self Paced
Self-Paced
Successfully Evaluating Predictive Modelling (edX) EdX
University of Edinburgh,EdinburghX

Successfully Evaluating Predictive Modelling (edX)

Gain an in-depth understanding of evaluation and sampling approaches for effective predictive modelling using Python. A predictive exercise is not finished when a model is built. This course will equip you with essential skills for understanding performance evaluation metrics, using Python, to determine whether a model is performing adequately.

Oct 26th 2021
5-12 Weeks
Computing in Python I: Fundamentals and Procedural Programming (edX) EdX
Georgia Institute of Technology,GTx

Computing in Python I: Fundamentals and Procedural Programming (edX)

Learn the fundamentals of computing in Python, including variables, operators, and writing and debugging your own programs. This course starts from the beginning, covering the basics of how a computer interprets lines of code; how to write programs, evaluate their output, and revise the code itself; how to work with variables and their changing values; and how to use mathematical, boolean, and relational operators.

Self Paced
Self-Paced
Programa con Python (edX) EdX
The Pontificia Universidad Javeriana,JaverianaX

Programa con Python (edX)

En este MOOC, aprenderás a desarrollar tus primeros programas en Python, un lenguaje de programación que, por su simplicidad y posibilidades, permite la creación de programas sencillos, de forma rápida y ágil. Este es tu primer paso para desarrollar habilidades básicas de programación. Hoy en día, la programación es una habilidad fundamental para el crecimiento y evolución de la tecnología. Python es uno de los lenguajes de programación que se ha popularizado para el desarrollo de software, tanto para las personas expertas como para las personas que inician su camino en la programación, debido a su accesibilidad, facilidad y usabilidad en varios entornos, en comparación con otros lenguajes.

No sessions available
5-12 Weeks
Computing in Python II: Control Structures (edX) EdX
Georgia Institute of Technology,GTx

Computing in Python II: Control Structures (edX)

Learn about control structures, one of the most powerful parts of programming. This course covers conditionals, loops, functions, and error handling, specifically in Python but with broader applicability to other languages as well. Building on your prior knowledge of variables and operators, this course gets into the meat of programming. Organized into five chapters, this course starts by covering the fundamentals of what control structures are and what they do, then moves on to four common control structures in Python.

Self Paced
Self-Paced