Pandas for Data Science (Coursera)

Offered by Duke University,
Pandas for Data Science (Coursera)

How can you effectively use Python to clean, sort, and store data? What are the benefits of using the Pandas library for data science? What best practices can data scientists leverage to better work with multiple types of datasets? In the third course of Data Science Python Foundations Specialization from Duke University, Python users will learn about how Pandas — a common library in Python used for data science — can ease their workflow.

Class Deals by MOOC List - Click here and see Coursera's Active Discounts, Deals, and Promo Codes.

We recommend you should take this course after the first two courses of the specialization. However, if you hold a prerequisite knowledge of basic algebra, Python programming, and NumPy, you should be able to complete the material in this course.
In the first week, we’ll discuss Python file concepts, including the programming syntax that allows you to read and write to a file. Then in the following weeks, we’ll transition into discussing Pandas more specifically and the pros and cons of using this library for specific data projects. By the end of this course, you should be able to know when to use Pandas, how to load and clean data in Pandas, and how to use Pandas for data manipulation. This will prepare you to take the next step in your data scientist journey using Python; creating larger software programs.

What you'll learn

  • How and when to leverage the Pandas library for your data science projects
  • Best practices for cleaning, manipulating, and optimizing data with Pandas

Syllabus

Week 1: Pandas for Data Science
This week, you will learn how to read data from files into your python program, and write that corresponding data to a file. We’ll be working primarily with string-type data in this unit and will give special attention to the way that python handles strings. Additionally we’ll go over some basic debugging in python using exception traces, and you’ll leverage these to create your own python program that is capable of reading and writing to a file.

Week 2: Tabular Data with Pandas
This Week, you’ll learn how to begin to utilize Pandas, one of the most commonly used libraries in Data Science with python. Pandas is predominantly used for working with tabular data. By the end of this week you’ll be able to identify the hallmarks and quirks of working with tabular data, describe the benefits and limitations of using Pandas, and be able to perform some basic data manipulation techniques in Pandas.

Week 3: Loading and Cleaning Data
This week, you will learn how to perform basic file operations in Pandas, as well as how to clean up large datasets. You’ll learn to read and write from common tabular file formats, and Pandas-specific intricacies for working with that data. Additionally, you’ll learn best practices for cleaning your data.

Week 4: Data Manipulation
This week you will learn how to combine datasets from different sources. Pandas has different methods of combining data depending on your preferred outcome, and you’ll be able to differentiate between when to use each kind. Additionally, we’ll go over computationally efficient ways of querying your data, which, while similar to selecting data via subsetting in its outcomes, has a distinct set of advantages.

Go to Class
MOOC List is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

Related Courses

Neural Networks and Deep Learning (Coursera) Coursera
DeepLearning.AI

Neural Networks and Deep Learning (Coursera)

If you want to break into cutting-edge AI, this course will help you do so. Deep learning engineers are highly sought after, and mastering deep learning will give you numerous new career opportunities. Deep learning is also a new "superpower" that will let you build AI systems that just weren't possible a few years ago. In this course, you will learn the foundations of deep learning.

Jun 22nd 2026
4 Weeks
Introduction to Machine Learning (Coursera) Coursera
Duke University

Introduction to Machine Learning (Coursera)

This course will provide you a foundational understanding of machine learning models (logistic regression, multilayer perceptrons, convolutional neural networks, natural language processing, etc.) as well as demonstrate how these models can solve complex problems in a variety of industries, from medical diagnostics to image recognition to text prediction.

Jun 26th 2026
5-12 Weeks
Bayesian Statistics: From Concept to Data Analysis (Coursera) Coursera
University of California, Santa Cruz

Bayesian Statistics: From Concept to Data Analysis (Coursera)

This course introduces the Bayesian approach to statistics, starting with the concept of probability and moving to the analysis of data. We will learn about the philosophy of the Bayesian approach as well as how to implement it for common types of data. We will compare the Bayesian approach to the more commonly-taught Frequentist approach, and see some of the benefits of the Bayesian approach.

Jun 22nd 2026
4 Weeks
Experimentation for Improvement (Coursera) Coursera
McMaster University

Experimentation for Improvement (Coursera)

We are always using experiments to improve our lives, our community, and our work. Are you doing it efficiently? Or are you (incorrectly) changing one thing at a time and hoping for the best? In this course, you will learn how to plan efficient experiments - testing with many variables. Our goal is to find the best results using only a few experiments. A key part of the course is how to optimize a system.

Jun 22nd 2026
5-12 Weeks
Tools for Data Science (Coursera) Coursera
IBM

Tools for Data Science (Coursera)

What are some of the most popular data science tools, how do you use them, and what are their features? In this course, you'll learn about Jupyter Notebooks, RStudio IDE, Apache Zeppelin and Data Science Experience. You will learn about what each tool is used for, what programming languages they can execute, their features and limitations. With the tools hosted in the cloud on Cognitive Class Labs, you will be able to test each tool and follow instructions to run simple code in Python, R or Scala.

Jun 22nd 2026
4 Weeks
AI Workflow: Machine Learning, Visual Recognition and NLP (Coursera) Coursera
IBM

AI Workflow: Machine Learning, Visual Recognition and NLP (Coursera)

This is the fourth course in the IBM AI Enterprise Workflow Certification specialization. You are STRONGLY encouraged to complete these courses in order as they are not individual independent courses, but part of a workflow where each course builds on the previous ones. Course 4 covers the next stage of the workflow, setting up models and their associated data pipelines for a hypothetical streaming media company.

Jun 22nd 2026
2 Weeks
Applied Text Mining in Python (Coursera) Coursera
University of Michigan

Applied Text Mining in Python (Coursera)

This course will introduce the learner to text mining and text manipulation basics. The course begins with an understanding of how text is handled by python, the structure of text both to the machine and to humans, and an overview of the nltk framework for manipulating text. The second week focuses on common manipulation needs, including regular expressions (searching for text), cleaning text, and preparing text for use by machine learning processes. The third week will apply basic natural language processing methods to text, and demonstrate how text classification is accomplished. The final week will explore more advanced methods for detecting the topics in documents and grouping them by similarity (topic modelling).

Jun 22nd 2026
4 Weeks