Big Data, Genes, and Medicine (Coursera)

Big Data, Genes, and Medicine (Coursera)

This course distills for you expert knowledge and skills mastered by professionals in Health Big Data Science and Bioinformatics. You will learn exciting facts about the human body biology and chemistry, genetics, and medicine that will be intertwined with the science of Big Data and skills to harness the avalanche of data openly available at your fingertips and which we are just starting to make sense of.

Class Deals by MOOC List - Click here and see Coursera's Active Discounts, Deals, and Promo Codes.

We’ll investigate the different steps required to master Big Data analytics on real datasets, including Next Generation Sequencing data, in a healthcare and biological context, from preparing data for analysis to completing the analysis, interpreting the results, visualizing them, and sharing the results.
Needless to say, when you master these high-demand skills, you will be well positioned to apply for or move to positions in biomedical data analytics and bioinformatics. No matter what your skill levels are in biomedical or technical areas, you will gain highly valuable new or sharpened skills that will make you stand-out as a professional and want to dive even deeper in biomedical Big Data. It is my hope that this course will spark your interest in the vast possibilities offered by publicly available Big Data to better understand, prevent, and treat diseases.

Syllabus

WEEK 1
Genes and Data
After this module, you will be able to 1. Locate and download files for data analysis involving genes and medicine. 2. Open files and preprocess data using R language. 3. Write R scripts to replace missing values, normalize data, discretize data, and sample data.

WEEK 2
Preparing Datasets for Analysis
After this module, you will be able to: 1. Locate and download files for data analysis involving genes and medicine. 2. Open files and preprocess data using R language. 3. Write R scripts to replace missing values, normalize data, discretize data, and sample data.

WEEK 3
Finding Differentially Expressed Genes
After this module, you will be able to 1. Select features from highly dimensional datasets. 2. Evaluate the performance of feature selection methods. 3. Write R scripts to select features from datasets involving gene expressions.

WEEK 4
Predicting Diseases from Genes
After this module, you will be able to 1. Build classification and prediction models. 2. Evaluate the performance of classification and prediction methods. 3. Write R scripts to classify and predict diseases from gene expressions.

WEEK 5
Determining Gene Alterations
After this module, you will be able to 1. List different types of gene alterations. 2. Compare and contrast methods for detecting gene mutations. 3. Compare and contrast methods for detecting methylation. 4. Compare and contrast methods for detecting copy number variations. 5. Quantify genomic alterations. 6. Connect genomic alterations to differential expression of genes. 7. Write programs in R for determining gene alterations and their relationship with gene expression.

WEEK 6
Clustering and Pathway Analysis
After this module, you will be able to 1. Find clusters in biomedical data involving genes.2. Analyze and visualize biological pathways. 3. Write R scripts for clustering and for pathway analysis.

Go to Class
MOOC List is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

Related Courses

Living with Dementia: Impact on Individuals, Caregivers, Communities and Societies (Coursera) Coursera
Johns Hopkins University

Living with Dementia: Impact on Individuals, Caregivers, Communities and Societies (Coursera)

Health professionals and students, family caregivers, friends of and affected individuals, and others interested in learning about dementia and quality care will benefit from completing the course. Led by Drs. Nancy Hodgson and Laura Gitlin, participants will acquire foundational knowledge in the care of persons with Alzheimer’s Disease and other neurocognitive disorders.

Jun 8th 2026
5-12 Weeks
Infonomics II: Business Information Management and Measurement (Coursera) Coursera
University of Illinois at Urbana-Champaign

Infonomics II: Business Information Management and Measurement (Coursera)

Even decades into the Information Age, accounting practices yet fail to recognize the financial value of information. Moreover, traditional asset management practices fail to recognize information as an asset to be managed with earnest discipline. This has led to a business culture of complacence, and the inability for most organizations to fully leverage available information assets. This second course in the two-part Infonomics series explores how and why to adapt well-honed asset management principles and practices to information, and how to apply accepted and new valuation models to gauge information’s potential and realized economic benefits.

Jun 10th 2026
4 Weeks
Take the Lead on Healthcare Quality Improvement (Coursera) Coursera
Case Western Reserve University

Take the Lead on Healthcare Quality Improvement (Coursera)

In this course you will learn about the importance of quality in healthcare and how you can contribute by implementing a quality improvement (QI) project to improve processes of care and patient outcomes. You will learn about powerful tools to add to your QI ‘toolbox’ during short lectures and reflective exercises. You will apply these tools to the implementation of a QI project in your own practice setting or an area of personal improvement. At the completion of the course, you will have a storyboard that captures your QI project success to share with others.

Jun 8th 2026
5-12 Weeks
Clinical Terminology for International and U.S. Students (Coursera) Coursera
University of Pittsburgh

Clinical Terminology for International and U.S. Students (Coursera)

Understanding the clinical terms and abbreviations commonly used in U.S. hospitals is challenging. Adaptation to clinical language can be difficult for U.S. students entering the clinical area and even more so for international students whose primary language is not English. This course is designed to help both groups of students understand the terms and abbreviations commonly encountered during the first three months of clinical work on a U.S. hospital unit.

Jun 8th 2026
5-12 Weeks
The Importance of Listening (Coursera) Coursera
Northwestern University

The Importance of Listening (Coursera)

In this second MOOC in the Social Marketing Specialization - "The Importance of Listening" - you will go deep into the Big Data of social and gain a more complete picture of what can be learned from interactions on social sites. You will be amazed at just how much information can be extracted from a single post, picture, or video.

Jun 8th 2026
4 Weeks
Bioinformatic Methods II (Coursera) Coursera
University of Toronto

Bioinformatic Methods II (Coursera)

Large-scale biology projects such as the sequencing of the human genome and gene expression surveys using RNA-seq, microarrays and other technologies have created a wealth of data for biologists. However, the challenge facing scientists is analyzing and even accessing these data to extract useful information pertaining to the system being studied. This course focuses on employing existing bioinformatic resources – mainly web-based programs and databases – to access the wealth of data to answer questions relevant to the average biologist, and is highly hands-on.

Jun 8th 2026
5-12 Weeks
Machine Learning: Regression (Coursera) Coursera
University of Washington

Machine Learning: Regression (Coursera)

Case Study - Predicting Housing Prices. In our first case study, predicting house prices, you will create models that predict a continuous value (price) from input features (square footage, number of bedrooms and bathrooms,...). This is just one of the many places where regression can be applied. Other applications range from predicting health outcomes in medicine, stock prices in finance, and power usage in high-performance computing, to analyzing which regulators are important for gene expression.

Jun 8th 2026
5-12 Weeks
Reproducible Research (Coursera) Coursera
Johns Hopkins University

Reproducible Research (Coursera)

This course focuses on the concepts and tools behind reporting modern data analyses in a reproducible manner. Reproducible research is the idea that data analyses, and more generally, scientific claims, are published with their data and software code so that others may verify the findings and build upon them. The need for reproducibility is increasing dramatically as data analyses become more complex, involving larger datasets and more sophisticated computations.

Jun 8th 2026
4 Weeks
Unordered Data Structures (Coursera) Coursera
University of Illinois at Urbana-Champaign

Unordered Data Structures (Coursera)

The Unordered Data Structures course covers the data structures and algorithms needed to implement hash tables, disjoint sets and graphs. These fundamental data structures are useful for unordered data. For example, a hash table provides immediate access to data indexed by an arbitrary key value, that could be a number (such as a memory address for cached memory), a URL (such as for a web cache) or a dictionary.

Jun 10th 2026
4 Weeks
Machine Learning With Big Data (Coursera) Coursera
University of California, San Diego

Machine Learning With Big Data (Coursera)

Want to make sense of the volumes of data you have collected? Need to incorporate data-driven decisions into your process? This course provides an overview of machine learning techniques to explore, analyze, and leverage data. You will be introduced to tools and algorithms you can use to create machine learning models that learn from data, and to scale those models up to big data problems.

Jun 8th 2026
5-12 Weeks
Practical Machine Learning (Coursera) Coursera
Johns Hopkins University

Practical Machine Learning (Coursera)

One of the most common tasks performed by data scientists and data analysts are prediction and machine learning. This course will cover the basic components of building and applying prediction functions with an emphasis on practical applications. The course will provide basic grounding in concepts such as training and tests sets, overfitting, and error rates.

Jun 8th 2026
4 Weeks