Moneyball and Beyond (Coursera)

Moneyball and Beyond (Coursera)

The book Moneyball triggered a revolution in the analysis of performance statistics in professional sports, by showing that data analytics could be used to increase team winning percentage. This course shows how to program data using Python to test the claims that lie behind the Moneyball story, and to examine the evolution of Moneyball statistics since the book was published.

Class Deals by MOOC List - Click here and see Coursera's Active Discounts, Deals, and Promo Codes.

The learner is led through the process of calculating baseball performance statistics from publicly available datasets. The course progresses from the analysis of on base percentage and slugging percentage to more advanced measures derived using the run expectancy matrix, such as wins above replacement (WAR). By the end of this course the learner will be able to use these statistics to conduct their own team and player analyses.

Course 2 of 5 in the Sports Performance Analytics Specialization

Syllabus

WEEK 1
In this module we introduce the Moneyball story and explore the method used to test that story. We begin the process of replicating the moneyball test by establishing the relationship between team winning and and two performance statistics - on base percentage (OBP) and slugging percentage (SLG).

WEEK 2
In this module we estimate the relationship between MLB player salaries and their performance statistics, OBP (on base percentage) and SLG (slugging). The results appear to confirm the Moneyball story - OBP was undervalued relative to SLG prior to the publication of Moneyball, while after publication the relative significance is reversed.

WEEK 3
This module updates the analysis of Hakes & Sauer and estimates the rewards to OBP and SLG over the period 1994 -2015. In addition it shows how rewards can be related to individual components of SLG: walks, singles, doubles, triples, and home runs.

WEEK 4
This module introduces the concept of run expectancy, shows how to derive the run expectancy matrix and the calculation of run values based on an MLB dataset of all events in the 2018 season. Run values are calculated by event type (walks, singles, doubles, etc.) and by player.

WEEK 5
This module examines the concept of Wins Above Replacement (WAR) and shows how to calculate WAR based on batting performance. The relationship between play run values team win percentage and player salaries is then explored. Run values are shown to have a high degree of correlation with winning and with salaries. Run values can to a limited extent predict win percentage.

Go to Class
MOOC List is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

Related Courses

Machine Learning: Regression (Coursera) Coursera
University of Washington

Machine Learning: Regression (Coursera)

Case Study - Predicting Housing Prices. In our first case study, predicting house prices, you will create models that predict a continuous value (price) from input features (square footage, number of bedrooms and bathrooms,...). This is just one of the many places where regression can be applied. Other applications range from predicting health outcomes in medicine, stock prices in finance, and power usage in high-performance computing, to analyzing which regulators are important for gene expression.

Jun 22nd 2026
5-12 Weeks
Accounting Data Analytics with Python (Coursera) Coursera
University of Illinois at Urbana-Champaign

Accounting Data Analytics with Python (Coursera)

This course focuses on developing Python skills for assembling business data. It will cover some of the same material from Introduction to Accounting Data Analytics and Visualization, but in a more general purpose programming environment (Jupyter Notebook for Python), rather than in Excel and the Visual Basic Editor. These concepts are taught within the context of one or more accounting data domains (e.g., financial statement data from EDGAR, stock data, loan data, point-of-sale data).

Jun 22nd 2026
5-12 Weeks
Introduction to Python Programming (Coursera) Coursera
University of Pennsylvania

Introduction to Python Programming (Coursera)

This course provides an introduction to programming and the Python language. Students are introduced to core programming concepts like data structures, conditionals, loops, variables, and functions. This course includes an overview of the various tools available for writing and running Python, and gets students coding quickly. It also provides hands-on coding exercises using commonly used data structures, writing custom functions, and reading and writing to files.

Jun 22nd 2026
4 Weeks
The Raspberry Pi Platform and Python Programming for the Raspberry Pi (Coursera) Coursera
University of California, Irvine

The Raspberry Pi Platform and Python Programming for the Raspberry Pi (Coursera)

The Raspberry Pi is a small, affordable single-board computer that you will use to design and develop fun and practical IoT devices while learning programming and computer hardware. In addition, you will learn how to set up up the Raspberry Pi environment, get a Linux operating system running, and write and execute some basic Python code on the Raspberry Pi. You will also learn how to use Python-based IDE (integrated development environments) for the Raspberry Pi and how to trace and debug Python code on the device.

Jun 22nd 2026
4 Weeks
Applied Text Mining in Python (Coursera) Coursera
University of Michigan

Applied Text Mining in Python (Coursera)

This course will introduce the learner to text mining and text manipulation basics. The course begins with an understanding of how text is handled by python, the structure of text both to the machine and to humans, and an overview of the nltk framework for manipulating text. The second week focuses on common manipulation needs, including regular expressions (searching for text), cleaning text, and preparing text for use by machine learning processes. The third week will apply basic natural language processing methods to text, and demonstrate how text classification is accomplished. The final week will explore more advanced methods for detecting the topics in documents and grouping them by similarity (topic modelling).

Jun 22nd 2026
4 Weeks
Machine Learning Rapid Prototyping with IBM Watson Studio (Coursera) Coursera
IBM

Machine Learning Rapid Prototyping with IBM Watson Studio (Coursera)

An emerging trend in AI is the availability of technologies in which automation is used to select a best-fit model, perform feature engineering and improve model performance via hyperparameter optimization. This automation will provide rapid-prototyping of models and allow the Data Scientist to focus their efforts on applying domain knowledge to fine-tune models. This course will take the learner through the creation of an end-to-end automated pipeline built by Watson Studio’s AutoAI experiment tool, explaining the underlying technology at work as developed by IBM Research.

Jun 22nd 2026
4 Weeks
Deploying Machine Learning Models (Coursera) Coursera
University of California, San Diego

Deploying Machine Learning Models (Coursera)

In this course we will learn about Recommender Systems (which we will study for the Capstone project), and also look at deployment issues for data products. By the end of this course, you should be able to implement a working recommender system (e.g. to predict ratings, or generate lists of related products), and you should understand the tools and techniques required to deploy such a working system on real-world, large-scale datasets.

Jun 22nd 2026
4 Weeks
Object-Oriented Python: Inheritance and Encapsulation (Coursera) Coursera
Codio

Object-Oriented Python: Inheritance and Encapsulation (Coursera)

Code and run your first python program in minutes without installing anything! This course is designed for learners with limited coding experience, providing a solid foundation of not just python, but core Computer Science topics that can be transferred to other languages. The modules in this course cover inheritance, encapsulation, polymorphism, and other object-related topics. Completion of the prior 3 courses in this specialization is recommended.

Jun 22nd 2026
4 Weeks
Data Analysis and Visualization (Coursera) Coursera
University at Buffalo,The State University of New York

Data Analysis and Visualization (Coursera)

By the end of this course, learners are provided a high-level overview of data analysis and visualization tools, and are prepared to discuss best practices and develop an ensuing action plan that addresses key discoveries. It begins with common hurdles that obstruct adoption of a data-driven culture before introducing data analysis tools (R software, Minitab, MATLAB, and Python). Deeper examination is spent on statistical process control (SPC), which is a method for studying variation over time. The course also addresses do’s and don’ts of presenting data visually, visualization software (Tableau, Excel, Power BI), and creating a data story.

Jun 22nd 2026
4 Weeks