Analyze Datasets and Train ML Models using AutoML (Coursera)

Offered by DeepLearning.AI, AWS,
Analyze Datasets and Train ML Models using AutoML (Coursera)

In the first course of the Practical Data Science Specialization, you will learn foundational concepts for exploratory data analysis (EDA), automated machine learning (AutoML), and text classification algorithms. With Amazon SageMaker Clarify and Amazon SageMaker Data Wrangler, you will analyze a dataset for statistical bias, transform the dataset into machine-readable features, and select the most important features to train a multi-class text classifier.

Class Deals by MOOC List - Click here and see Coursera's Active Discounts, Deals, and Promo Codes.

You will then perform automated machine learning (AutoML) to automatically train, tune, and deploy the best text-classification algorithm for the given dataset using Amazon SageMaker Autopilot. Next, you will work with Amazon SageMaker BlazingText, a highly optimized and scalable implementation of the popular FastText algorithm, to train a text classifier with very little code.
Practical data science is geared towards handling massive datasets that do not fit in your local hardware and could originate from multiple sources. One of the biggest benefits of developing and running data science projects in the cloud is the agility and elasticity that the cloud offers to scale up and out at a minimum cost.
The Practical Data Science Specialization helps you develop the practical skills to effectively deploy your data science projects and overcome challenges at each step of the ML workflow using Amazon SageMaker. This Specialization is designed for data-focused developers, scientists, and analysts familiar with the Python and SQL programming languages and want to learn how to build, train, and deploy scalable, end-to-end ML pipelines - both automated and human-in-the-loop - in the AWS cloud.
Course 1 of 3 in the Practical Data Science Specialization.

What You Will Learn
Prepare data, detect statistical data biases, and perform feature engineering at scale to train models with pre-built algorithms.

Syllabus

WEEK 1
Explore the Use Case and Analyze the Dataset
Ingest, explore, and visualize a product review data set for multi-class text classification.

WEEK 2
Data Bias and Feature Importance
Determine the most important features in a data set and detect statistical biases.

WEEK 3
Use Automated Machine Learning to train a Text Classifier
Inspect and compare models generated with automated machine learning (AutoML).

WEEK 4
Built-in algorithms
Train a text classifier with BlazingText and deploy the classifier as a real-time inference endpoint to serve predictions.

Go to Class
MOOC List is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

Related Courses

SQL for Data Science Capstone Project (Coursera) Coursera
University of California, Davis

SQL for Data Science Capstone Project (Coursera)

Data science is a dynamic and growing career field that demands knowledge and skills-based in SQL to be successful. This course is designed to provide you with a solid foundation in applying SQL skills to analyze data and solve real business problems. Whether you have successfully completed the other courses in the Learn SQL Basics for Data Science Specialization or are taking just this course, this project is your chance to apply the knowledge and skills you have acquired to practice important SQL querying and solve problems with data. You will participate in your own personal or professional journey to create a portfolio-worthy piece from start to finish.

Jun 15th 2026
4 Weeks
Biases and Portfolio Selection (Coursera) Coursera
Rice University

Biases and Portfolio Selection (Coursera)

Investors tend to be their own worst enemies. In this third course, you will learn how to capitalize on understanding behavioral biases and irrational behavior in financial markets. You will start by learning about the various behavioral biases – mistakes that investors make and understand their reasons. You will learn how to recognize your own mistakes as well as others’ and understand how these mistakes can affect investment decisions and financial markets. You will also explore how different preferences and investment horizons impact the optimal asset allocation choice.

Jun 15th 2026
4 Weeks
Unpacking Unconscious Bias in the Workplace (Coursera) Coursera
Coursera Instructor Network

Unpacking Unconscious Bias in the Workplace (Coursera)

“Unpacking Unconscious Bias in the Workplace” is an engaging, short-form course designed for those interested in learning how to not only mitigate personal biases, but also disrupt bias in the workplace. Through self-reflection, learners will begin the course by identifying the key influences that shaped their perspectives and biases, then articulate how those biases impact their behavior in the present.

Jun 15th 2026
2 Weeks
Design and Conduct of Clinical Trials (Coursera) Coursera
Johns Hopkins University

Design and Conduct of Clinical Trials (Coursera)

In this course, you’ll learn how to design and carry out clinical trials. Each design choice has implications for the quality and validity of your results. This course provides you and your team with essential skills to evaluate options, make good design choices, and implement them within your trial. You’ll learn to control for bias, randomize participants, mask treatments and outcomes, identify errors, develop and test hypotheses, and define appropriate outcomes.

Jun 15th 2026
4 Weeks
Perform exploratory data analysis on retail data with Python (Coursera) Coursera
Coursera Project Network

Perform exploratory data analysis on retail data with Python (Coursera)

In this project, you'll serve as a data analyst at an online retail company helping interpret real-world data to help make key business decisions. Your task is to explore and analyze this dataset to gain insights into the store's sales trends, customer behavior, and popular products.

Jun 15th 2026
1 Week
Managing Data Analysis (Coursera) Coursera
Johns Hopkins University

Managing Data Analysis (Coursera)

This one-week course describes the process of analyzing data and how to manage that process. We describe the iterative nature of data analysis and the role of stating a sharp question, exploratory data analysis, inference, formal statistical modeling, interpretation, and communication. In addition, we will describe how to direct analytic activities within a team and to drive the data analysis process towards coherent and useful results.

Jun 15th 2026
1 Week
Capstone Project: Predicting Safety Stock (Coursera) Coursera
LearnQuest

Capstone Project: Predicting Safety Stock (Coursera)

In this course, we'll make predictions on product usage and calculate optimal safety stock storage. We'll start with a time series of shoe sales across multiple stores on three different continents. To begin, we'll look for unique insights and other interesting things we can find in the data by performing groupings and comparing products within each store.

Jun 15th 2026
3 Weeks
Ecosystem Services: a Method for Sustainable Development (Coursera) Coursera
University of Geneva

Ecosystem Services: a Method for Sustainable Development (Coursera)

Ecosystem services are a way of thinking about – and evaluating – the goods and services provided by nature that contribute to the well-being of humans. This MOOC will cover scientific (technical), economic, and socio-political dimensions of the concept through a mix of theory, case-studies, interviews with specialists and a serious-game.

Jun 15th 2026
5-12 Weeks
English for Media Literacy (Coursera) Coursera
University of Pennsylvania

English for Media Literacy (Coursera)

Welcome to English for Media Literacy! This course is designed for non-native English speakers who are interested in learning more about U.S. media literacy. In this course, you will explore different types of mass media; such as, newspapers, magazines, television, and social media. This course will also give you the opportunity to develop a broader understanding of the role media plays in our lives, while building your vocabulary and giving you the language skills needed to analyze what you read and watch.

Jun 8th 2026
5-12 Weeks
Predictive Modeling and Analytics (Coursera) Coursera
University of Colorado Boulder

Predictive Modeling and Analytics (Coursera)

Welcome to the second course in the Data Analytics for Business specialization! This course will introduce you to some of the most widely used predictive modeling techniques and their core principles. By taking this course, you will form a solid foundation of predictive analytics, which refers to tools and techniques for building statistical or machine learning models to make predictions based on data. You will learn how to carry out exploratory data analysis to gain insights and prepare data for predictive modeling, an essential skill valued in the business.

Jun 8th 2026
4 Weeks
Introduction to Probability and Data with R (Coursera) Coursera
Duke University

Introduction to Probability and Data with R (Coursera)

This course introduces you to sampling and exploring data, as well as basic probability theory and Bayes' rule. You will examine various types of sampling methods, and discuss how such methods can impact the scope of inference. A variety of exploratory data analysis techniques will be covered, including numeric summary statistics and basic data visualization.

Jun 8th 2026
5-12 Weeks