EdX

Cluster Analysis (edX)

Cluster Analysis (edX)

Learn how to conduct a cluster analysis to discover important patterns in student behavior using the popular Weka data mining toolkit. In this course, you will learn the basics of cluster analysis, one of the most popular data mining methods for the discovery of patterns in learning data, and its application in learning analytics.

Class Deals by MOOC List - Click here and see EdX's Active Discounts, Deals, and Promo Codes.

Cluster analysis enables the identification of common, archetypal patterns of student interactions, which can lead to better understanding of student learning behaviors and provision of personalized feedback and interventions.
This course will have a strong hands-on component, as you will learn how to conduct a cluster analysis using the popular Weka data mining toolkit.
We will cover K-means and Hierarchical clustering techniques, which are two simple, yet widely used, cluster analysis methods. We will also review some of the published learning analytics studies that adopted cluster analysis and learn how to interpret the cluster analysis results.
Finally, we will also examine some of the more advanced techniques and identify certain practical challenges with cluster analysis, such as the selection of the optimal number of clusters and the validation of cluster analysis results.

What you'll learn

  • Understand clustering and its use in learning analytics
  • How to use the Weka toolkit to conduct cluster analysis
  • Popular clustering algorithms (k-means, hierarchical clustering, EM clustering)
  • How to interpret cluster analysis results
  • How to use clustering in learning analytics to solve problems, such as improving student learning experiences and learning outcomes, increasing retention, or providing personalized feedback and support to students
  • How to determine an optimal number of clusters for the analysis

Syllabus

Week 1: Introduction
Lectures:
Introduction to unsupervised machine learning methods
Introduction to clustering
Overview of clustering uses for learning analytics
Labs:
Introduction to Weka toolkit

Week 2: Overview of k-means and hierarchical clustering methods
Lectures:
K-means clustering theory
K-means full example
Hierarchical clustering theory
Hierarchical clustering full example
Labs:
Conducting k-means clustering using Weka
Conducting hierarchical clustering using Weka

Week 3: Practical considerations
Lectures:
How to choose the number of clusters
How to interpret clustering results
Overview of more advanced clustering methods
Labs:
Real-world cluster analysis walkthrough

Prerequisites
We highly recommend that you take the previous course in the series before beginning this course:
Social Network Analysis
This course is intended for those who have a bachelor’s degree and are interested in developing learning and data science skills for employment in education, corporate, nonprofit, and military sectors. Experience with programming and statistics will be beneficial to participants.

Go to Class
MOOC List is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

Related Courses

Sales Enablement & Analytics (edX) EdX
Queen’s University,QueensX

Sales Enablement & Analytics (edX)

Learn how to use data, tools, and technology to drive productivity and set yourself apart as sales leader. Organizations today have an influx of data which when used effectively can derive actionable insights for both the sales organization and their clients. Today's sales leadersneed to systematically increase their sophistication in leveraging data, tools and domain expertise to provide customized insights, consulting and guidance to their strategic customers.

Self Paced
Self-Paced
Text Retrieval and Search Engines (Coursera) Coursera
University of Illinois at Urbana-Champaign

Text Retrieval and Search Engines (Coursera)

Recent years have seen a dramatic growth of natural language text data, including web pages, news articles, scientific literature, emails, enterprise documents, and social media such as blog articles, forum posts, product reviews, and tweets. Text data are unique in that they are usually generated directly by humans rather than a computer system or sensors, and are thus especially valuable for discovering knowledge about people’s opinions and preferences, in addition to many other kinds of knowledge that we encode in text.

Jun 22nd 2026
5-12 Weeks
Applications of Linear Algebra Part 2 (edX) EdX
Davidson College,DavidsonX

Applications of Linear Algebra Part 2 (edX)

Explore applications of linear algebra in the field of data mining by learning fundamentals of search engines, clustering movies into genres and of computer graphics by posterizing an image. Our world is in a data deluge with ever increasing sizes of datasets. Linear algebra is a tool to manage and analyze such data. This course is part 2 of a 2-part course, with this part extending smoothly from the first. Note, however, that part 1, is not a prerequisite for part 2.

No sessions available
4 Weeks
Big Data and Education (edX) EdX
University of Pennsylvania,PennX

Big Data and Education (edX)

Learn the methods and strategies for using large-scale educational data to improve education and make discoveries about learning. Online and software-based learning tools have been used increasingly in education. This movement has resulted in an explosion of data, which can now be used to improve educational effectiveness and support basic research on learning.

Self Paced
Self-Paced
Fundamentos TIC para profesionales de negocios: Programación (edX) EdX
Universitat Politècnica de València,UPValenciaX

Fundamentos TIC para profesionales de negocios: Programación (edX)

¿Tienes que trabajar con las Tecnologías de la Información y te faltan conocimientos? Conoce los fundamentos de la programación software. Este curso forma parte de una serie de 5 cursos de introducción al uso de sistemas de información en las empresas que te introducirá en el apasionante mundo de las TIC.

Self Paced
Self-Paced
Smart Analytics, Machine Learning, and AI on Google Cloud (edX) EdX
Google Cloud

Smart Analytics, Machine Learning, and AI on Google Cloud (edX)

This course covers several ways machine learning can be included in data pipelines on Google Cloud depending on the level of customization required. Incorporating machine learning into data pipelines increases the ability of businesses to extract insights from their data. This course covers several ways machine learning can be included in data pipelines on Google Cloud depending on the level of customization required.

Self Paced
Self-Paced
Big Data Technology Capstone Project (edX) EdX
The Hong Kong University of Science and Technology - HKUST,HKUSTx

Big Data Technology Capstone Project (edX)

The Big Data Technology Capstone Project will allow you to apply the techniques and theory you have gained from the four courses in this MicroMasters program to a medium-scale project. In this capstone course, you will get an opportunity to apply the knowledge and skills that you have gained throughout this MicroMasters program.

Self Paced
Self-Paced
Data, Analytics and Learning (edX) EdX
University of Texas at Arlington,UTArlingtonX

Data, Analytics and Learning (edX)

An introduction to the logic and methods of analysis of data to improve teaching and learning. Capturing and analyzing data has changed how decisions are made and resources are allocated in businesses, journalism, government, and military and intelligence fields. Through better use of data, leaders are able to plan and enact strategies with greater clarity and confidence.

No sessions available
4 Weeks
Learning Analytics Fundamentals (edX) EdX
University of Texas at Arlington,UTArlingtonX

Learning Analytics Fundamentals (edX)

Learn about the growing field of learning analytics and how to analyze basic data sets to generate insights. The demand for data science and learning science skills has continued to increase as classrooms, labs, and organizations look to optimize their data and improve learning environments for students and employees. The UTArlingtonX Learning Analytics courses will give you the opportunity to gain invaluable knowledge and expertise in this growing field.

No sessions available
4 Weeks
Applied Bayesian for Analytics (edX) EdX
Indian Institute of Management, Bangalore,IIMBx

Applied Bayesian for Analytics (edX)

Learn how to construct, fit, estimate and compute Bayesian statistical models with the help of OpenBUGS (freely available software). Bayesian Statistics is a captivating field and is used most prominently in data sciences. In this course we will learn about the foundation of Bayesian concepts, how it differs from Classical Statistics including among others Parametrizations, Priors, Likelihood, Monte Carlo methods and computing Bayesian models with the exploration of Multilevel modelling.

Self Paced
Self-Paced