Data Wrangling with Python Project (Coursera)

Data Wrangling with Python Project (Coursera)

The "Data Wrangling Project" course provides students with an opportunity to apply the knowledge gained throughout the specialization in a real-life data wrangling project of their interest. Participants will follow the data wrangling pipeline step by step, from identifying data sources to processing and integrating data, to achieve a fine dataset ready for analysis. This course enables students to gain hands-on experience in the data wrangling process and prepares them to handle complex data challenges in real-world scenarios.

Class Deals by MOOC List - Click here and see Coursera's Active Discounts, Deals, and Promo Codes.

Throughout the course, students will work on their data wrangling project, applying the knowledge and skills gained in each module to achieve a refined and well-prepared dataset. By the end of the course, participants will be proficient in the data wrangling process and ready to tackle real-world data challenges in diverse domains.
This course is part of the Data Wrangling with Python Specialization.

What you'll learn

  • Initiate and conduct a data wrangling project from raw data to a refined dataset for analysis.
  • Apply data wrangling techniques learned in the specialization to handle real-life data scenarios.
  • Utilize Python libraries and tools effectively for data wrangling tasks. Communicate and present data wrangling results effectively to stakeholders.

Syllabus

Data Wrangling Pipeline
Module 1
In this introductory week, you will gain an understanding of the data wrangling pipeline, which serves as a structured approach to transform raw data into a cleaned and organized dataset for analysis. You will learn the key stages involved in the pipeline, setting the foundation for the rest of the course.

Identify Your Data
Module 2
In this week, you will learn how to identify and define the scope and objectives of your data wrangling project. You will explore various data sources, understand their structure, and assess the suitability of each source for the project.

Data Collection and Integration
Module 3
This week covers the data collection and integration stage of the data wrangling process. You will learn techniques for data collection, validate the collected data, and integrate data from multiple sources.

Data Understanding and Visualization
Module 4
This week focuses on gaining a comprehensive understanding of the dataset through statistical analysis and data visualization. You will learn how to perform descriptive statistics, create informative visualizations, and conduct exploratory data analysis (EDA).

Data Processing and Manipulation
Module 5
In this week, you will delve into essential data processing and manipulation techniques. You will learn how to handle missing values, detect and handle outliers, perform data sampling and dimensionality reduction, apply data scaling and discretization, and explore data cubes and pivot tables.

Go to Class
MOOC List is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

Related Courses

Data-driven Decision Making (Coursera) Coursera
PwC

Data-driven Decision Making (Coursera)

Welcome to Data-driven Decision Making. In this course, you'll get an introduction to Data Analytics and its role in business decisions. You'll learn why data is important and how it has evolved. You'll be introduced to “Big Data” and how it is used. You'll also be introduced to a framework for conducting Data Analysis and what tools and techniques are commonly used. Finally, you'll have a chance to put your knowledge to work in a simulated business setting. This course was created by PricewaterhouseCoopers LLP with an address at 300 Madison Avenue, New York, New York, 10017.

Jun 1st 2026
4 Weeks
Fundamentals of Visualization with Tableau (Coursera) Coursera
University of California, Davis

Fundamentals of Visualization with Tableau (Coursera)

In this first course of the specialization, you will discover just what data visualization is, and how we can use it to better see and understand data. Using Tableau, we’ll examine the fundamental concepts of data visualization and explore the Tableau interface, identifying and applying the various tools Tableau has to offer.

Jun 1st 2026
4 Weeks
Data Science Companion (Coursera) Coursera
MathWorks

Data Science Companion (Coursera)

The Data Science Companion provides an introduction to data science. You will gain a quick background in data science and core machine learning concepts, such as regression and classification. You’ll be introduced to the practical knowledge of data processing and visualization using low-code solutions, as well as an overview of the ways to integrate multiple tools effectively to solve data science problems.

Jun 5th 2026
4 Weeks
Python Data Representations (Coursera) Coursera
Rice University

Python Data Representations (Coursera)

This course will continue the introduction to Python programming that started with Python Programming Essentials. We'll learn about different data representations, including strings, lists, and tuples, that form the core of all Python programs. We will also teach you how to access files, which will allow you to store and retrieve data within your programs. These concepts and skills will help you to manipulate data and write more complex Python programs.

Jun 1st 2026
4 Weeks
Dealing With Missing Data (Coursera) Coursera
University of Maryland, College Park

Dealing With Missing Data (Coursera)

This course will cover the steps used in weighting sample surveys, including methods for adjusting for nonresponse and using data external to the survey for calibration. Among the techniques discussed are adjustments using estimated response propensities, poststratification, raking, and general regression estimation. Alternative techniques for imputing values for missing items will be discussed. For both weighting and imputation, the capabilities of different statistical software packages will be covered, including R®, Stata®, and SAS®.

Jun 1st 2026
4 Weeks
Data Analysis Tools (Coursera) Coursera
Wesleyan University

Data Analysis Tools (Coursera)

In this course, you will develop and test hypotheses about your data. You will learn a variety of statistical tests, as well as strategies to know how to apply the appropriate one to your specific data and question. Using your choice of two powerful statistical software packages (SAS or Python), you will explore ANOVA, Chi-Square, and Pearson correlation analysis. This course will guide you through basic statistical principles to give you the tools to answer questions you have developed. Throughout the course, you will share your progress with others to gain valuable feedback and provide insight to other learners about their work.

Jun 1st 2026
4 Weeks
Foundations: Data, Data, Everywhere (Coursera) Coursera
Google

Foundations: Data, Data, Everywhere (Coursera)

This is the first course in the Google Data Analytics Certificate. These courses will equip you with the skills you need to apply to introductory-level data analyst jobs. Organizations of all kinds need data analysts to help them improve their processes, identify opportunities and trends, launch new products, and make thoughtful decisions. In this course, you’ll be introduced to the world of data analytics through hands-on curriculum developed by Google. The material shared covers plenty of key data analytics topics, and it’s designed to give you an overview of what’s to come in the Google Data Analytics Certificate. Current Google data analysts will instruct and provide you with hands-on ways to accomplish common data analyst tasks with the best tools and resources.

Jun 2nd 2026
5-12 Weeks
Process Mining: Data science in Action (Coursera) Coursera
Eindhoven University of Technology

Process Mining: Data science in Action (Coursera)

Process mining is the missing link between model-based process analysis and data-oriented analysis techniques. Through concrete data sets and easy to use software the course provides data science knowledge that can be applied directly to analyze and improve processes in a variety of domains. Data science is the profession of the future, because organizations that are unable to use (big) data in a smart way will not survive. It is not sufficient to focus on data storage and data analysis. The data scientist also needs to relate data to process analysis.

Jun 1st 2026
5-12 Weeks
Machine Learning: Classification (Coursera) Coursera
University of Washington

Machine Learning: Classification (Coursera)

Case Studies: Analyzing Sentiment & Loan Default Prediction. In our case study on analyzing sentiment, you will create models that predict a class (positive/negative sentiment) from input features (text of the reviews, user profile information,...). In our second case study for this course, loan default prediction, you will tackle financial data, and predict when a loan is likely to be risky or safe for the bank.

Jun 1st 2026
5-12 Weeks
Text Mining and Analytics (Coursera) Coursera
University of Illinois at Urbana-Champaign

Text Mining and Analytics (Coursera)

This course will cover the major techniques for mining and analyzing text data to discover interesting patterns, extract useful knowledge, and support decision making, with an emphasis on statistical approaches that can be generally applied to arbitrary text data in any natural language with no or minimum human effort.

Jun 1st 2026
5-12 Weeks
Social Media Data Analytics (Coursera) Coursera
University of Washington

Social Media Data Analytics (Coursera)

Learner Outcomes: After taking this course, you will be able to: utilize various Application Programming Interface (API) services to collect data from different social media sources such as YouTube, Twitter, and Flickr; process the collected data - primarily structured - using methods involving correlation, regression, and classification to derive insights about the sources and people who generated that data; analyze unstructured data - primarily textual comments - for sentiments expressed in them; use different tools for collecting, analyzing, and exploring social media data for research and development purposes.

Jun 1st 2026
4 Weeks
Regression Modeling in Practice (Coursera) Coursera
Wesleyan University

Regression Modeling in Practice (Coursera)

This course focuses on one of the most important tools in your data analysis arsenal: regression analysis. Using either SAS or Python, you will begin with linear regression and then learn how to adapt when two variables do not present a clear linear relationship. You will examine multiple predictors of your outcome and be able to identify confounding variables, which can tell a more compelling story about your results. You will learn the assumptions underlying regression analysis, how to interpret regression coefficients, and how to use regression diagnostic plots and other tools to evaluate the quality of your regression model. Throughout the course, you will share with others the regression models you have developed and the stories they tell you.

Jun 5th 2026
4 Weeks