Prediction Models with Sports Data (Coursera)

Prediction Models with Sports Data (Coursera)

In this course the learner will be shown how to generate forecasts of game results in professional sports using Python. The main emphasis of the course is on teaching the method of logistic regression as a way of modeling game results, using data on team expenditures. The learner is taken through the process of modeling past results, and then using the model to forecast the outcome games not yet played.

Class Deals by MOOC List - Click here and see Coursera's Active Discounts, Deals, and Promo Codes.

The course will show the learner how to evaluate the reliability of a model using data on betting odds. The analysis is applied first to the English Premier League, then the NBA and NHL. The course also provides an overview of the relationship between data analytics and gambling, its history and the social issues that arise in relation to sports betting, including the personal risks.
Course 3 of 5 in the Sports Performance Analytics Specialization.

Syllabus

WEEK 1
This module introduces the regression models in dealing with the categorical outcome variables in sport contest (i.e., Win, Draw, Lose). It explains the Linear Probability Model (LPM) in terms of its theoretical foundations, computational applications, and empirical limitations. Then the module introduces and demonstrates the Logistic Regression as a better substitute of LPM for the categorical dependent variables.

WEEK 2
This module explores the relationship between probability and betting markets. It explains the concept of odds, and the relationship between betting odds and probabilities. It then develops a measure of the accuracy of betting odds using sports examples, and assesses the meaning of efficiency in betting markets.

WEEK 3
This module shows how to forecast the outcome of EPL soccer games using an ordered logit model and publicly available information. It assesses the accuracy of these forecasts against the betting odds and shows that they are remarkably accurate.

WEEK 4
This module assesses the efficacy of the EPL forecasting model covered in the previous week by replicating the model in the context of three North American team sports leagues (i.e., NHL, NBA, MLB). Specifically, this module shows how to forecast the outcome of NHL, NBA, MLB regular season games using an ordered logit model and publicly available information. It assesses the accuracy of these forecasts against the betting odds.

WEEK 5
In this module we examine the historical and social consequences of gambling, and the relationship between gambling and statistics. Gambling is explored from the perspective of different ethical and religious systems. Issues of problem gambling are explored and assessed.

Go to Class
MOOC List is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

Related Courses

Machine Learning Foundations: A Case Study Approach (Coursera) Coursera
University of Washington

Machine Learning Foundations: A Case Study Approach (Coursera)

Do you have data and wonder what it can tell you? Do you need a deeper understanding of the core ways in which machine learning can improve your business? Do you want to be able to converse with specialists about anything from regression and classification to deep learning and recommender systems? In this course, you will get hands-on experience with machine learning from a series of practical case-studies.

Jun 22nd 2026
5-12 Weeks
International Entertainment and Sports Marketing (Coursera) Coursera
Yonsei University

International Entertainment and Sports Marketing (Coursera)

This course will provide learners with a fundamental understanding of the characteristics and marketing strategies related to two key global industries, sports and entertainment. The growth in both industries have been fueled by their ability to innovate via CCCI, i.e. cross-country and cross-industry expansion. There will be a graded quiz that will consists of 10 questions during the first two weeks (together worth 50% of the grade) and a final quiz that contains 20 questions (worth 50% of the grade) in the third week.

Jun 22nd 2026
3 Weeks
Applied Text Mining in Python (Coursera) Coursera
University of Michigan

Applied Text Mining in Python (Coursera)

This course will introduce the learner to text mining and text manipulation basics. The course begins with an understanding of how text is handled by python, the structure of text both to the machine and to humans, and an overview of the nltk framework for manipulating text. The second week focuses on common manipulation needs, including regular expressions (searching for text), cleaning text, and preparing text for use by machine learning processes. The third week will apply basic natural language processing methods to text, and demonstrate how text classification is accomplished. The final week will explore more advanced methods for detecting the topics in documents and grouping them by similarity (topic modelling).

Jun 22nd 2026
4 Weeks
Python Classes and Inheritance (Coursera) Coursera
University of Michigan

Python Classes and Inheritance (Coursera)

This course introduces classes, instances, and inheritance. You will learn how to use classes to represent data in concise and natural ways. You'll also learn how to override built-in methods and how to create "inherited" classes that reuse functionality. You'll also learn about how to design classes. Finally, you will be introduced to the good programming habit of writing automated tests for their own code.

Jun 22nd 2026
3 Weeks
Basic Data Processing and Visualization (Coursera) Coursera
University of California, San Diego

Basic Data Processing and Visualization (Coursera)

This is the first course in the four-course specialization Python Data Products for Predictive Analytics, introducing the basics of reading and manipulating datasets in Python. In this course, you will learn what a data product is and go through several Python libraries to perform data retrieval, processing, and visualization.

Jun 22nd 2026
5-12 Weeks
Hypothesis Testing with Python and Excel (Coursera) Coursera
Tufts University

Hypothesis Testing with Python and Excel (Coursera)

In today's job market, leaders need to understand the fundamentals of data to be competitive. An essential procedure to understand business and analytics is hypothesis testing. This short course, designed by Tufts University expert faculty, will teach the fundamentals of hypothesis testing of a population mean and a population proportion, using Excel and Python for calculations. You'll also discover the central limit theorem, which is essential for hypothesis testing. To conclude the course, you will apply your newfound skills by creating a plan for an experiment in your own workplace that uses hypothesis testing.

Jun 23rd 2026
1 Week
Python for Data Science, AI & Development (Coursera) Coursera
IBM

Python for Data Science, AI & Development (Coursera)

Kickstart your learning of Python for data science, as well as programming in general, with this beginner-friendly introduction to Python. Python is one of the world’s most popular programming languages, and there has never been greater demand for professionals with the ability to apply Python fundamentals to drive business solutions across industries.

Jun 23rd 2026
5-12 Weeks
Data Management for Clinical Research (Coursera) Coursera
Vanderbilt University

Data Management for Clinical Research (Coursera)

This course presents critical concepts and practical methods to support planning, collection, storage, and dissemination of data in clinical research. Understanding and implementing solid data management principles is critical for any scientific domain. Regardless of your current (or anticipated) role in the research enterprise, a strong working knowledge and skill set in data management principles and practice will increase your productivity and improve your science. Our goal is to use these modules to help you learn and practice this skill set.

Jun 22nd 2026
5-12 Weeks
Python Project: pillow, tesseract, and opencv (Coursera) Coursera
University of Michigan

Python Project: pillow, tesseract, and opencv (Coursera)

This course will walk you through a hands-on project suitable for a portfolio. You will be introduced to third-party APIs and will be shown how to manipulate images using the Python imaging library (pillow), how to apply optical character recognition to images to recognize text (tesseract and py-tesseract), and how to identify faces in images using the popular opencv library. By the end of the course you will have worked with three different libraries available for Python 3 to create a real-world data-analysis project.

Jun 22nd 2026
3 Weeks