Real-Time Big Data Access using HBase: Boosting Performance (Coursera)

Offered by LearnQuest,
Real-Time Big Data Access using HBase: Boosting Performance (Coursera)

In the world of big data, the significant growth in both the sheer volume and variety of data has presented significant challenges. Apache HBase has emerged as a robust and scalable solution. HBase is a powerful, distributed, and scalable NoSQL database designed to handle large amounts of data while maintaining high performance.

Class Deals by MOOC List - Click here and see Coursera's Active Discounts, Deals, and Promo Codes.

In this introductory course, you will explore the fundamental concepts of HBase and its significance in handling real-time data processing and analysis.
Topics covered in this course include:
-Fundamentals of HBase
-Schema Design in HBase
-HBase Cluster Setup, Monitoring, and Backup
-HBase Querying and Retrieval Basics
-Advanced HBase Querying Techniques
-Batch Operations and Data Manipulation
-Data Modeling for Real-Time Applications
-Deployment Strategies for Real-Time Applications
-Scalability and Availability in Real-Time Applications
This course includes video lectures, video demonstrations, as well as hands-on application in a lab environment. By the end of this course, you will be able to design efficient HBase schemas, set up and optimize HBase clusters, perform data operations, and evaluate the proper application of HBase in real-time scenarios while considering scalability and effective deployment strategies.

Syllabus

Introduction to HBase
In this module, you will first be introduced to your instructor and the course. Then, we will delve into HBase, exploring NoSQL Databases, HDFS, HBase Architecture and components, and HBase clusters. We'll compare HBase with other big data landscapes, and discuss creating effective data modeling and schema design.

HBase Querying and Data Access
In this module, we will discuss importing data to HBase using Sqoop, and explore querying techniques such as Scans, Filters, and Get requests, then practice constructing HBase queries in a lab environment. We will take a look at advanced querying using a Java application, and optimizing HBase performance using caching and scan optimization. Lastly, we will discuss manipulating data using batch operations for improving performance.

HBase in Real-Time Applications
In this final module, we will discuss HBase data modelling for real-time applications, delving into strategies and use cases. We will also explore deployment strategies for real-time applications, starting with best practices and applying it in a lab environment. Lastly, we will take a look at HBase scalability and availability in real-time applications, discussing limitations and techniques for analyzing performance, before practicing what you have learned.

Go to Class
MOOC List is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

Related Courses

Big Data Emerging Technologies (Coursera) Coursera
Yonsei University

Big Data Emerging Technologies (Coursera)

Every time you use Google to search something, every time you use Facebook, Twitter, Instagram or any other SNS (Social Network Service), and every time you buy from a recommended list of products on Amazon.com you are using a big data system. In addition, big data technology supports your smartphone, smartwatch, Alexa, Siri, and automobile (if it is a newer model) every day. The top companies in the world are currently using big data technology, and every company is in need of advanced big data technology support. Simply put, big data technology is not an option for your company, it is a necessity for survival and growth.

Jun 29th 2026
5-12 Weeks
Big Data Integration and Processing (Coursera) Coursera
University of California, San Diego

Big Data Integration and Processing (Coursera)

At the end of the course, you will be able to: Retrieve data from example database and big data management systems; Describe the connections between data management operations and the big data processing patterns needed to utilize them in large-scale analytical applications; Identify when a big data problem needs data integration; Execute simple big data integration and processing on Hadoop and Spark platforms.

Jun 29th 2026
5-12 Weeks
Big Data Modeling and Management Systems (Coursera) Coursera
University of California, San Diego

Big Data Modeling and Management Systems (Coursera)

Once you’ve identified a big data issue to analyze, how do you collect, store and organize your data using Big Data solutions? In this course, you will experience various data genres and management tools appropriate for each. You will be able to describe the reasons behind the evolving plethora of new big data platforms from the perspective of big data management systems and analytical tools.

Jun 22nd 2026
5-12 Weeks
Big Data Science with the BD2K-LINCS Data Coordination and Integration Center (Coursera) Coursera
Icahn School of Medicine at Mount Sinai

Big Data Science with the BD2K-LINCS Data Coordination and Integration Center (Coursera)

In this course we briefly introduce the DCIC and the various Centers that collect data for LINCS. We then cover metadata and how metadata is linked to ontologies. We then present data processing and normalization methods to clean and harmonize LINCS data. This follow discussions about how data is served as RESTful APIs. Most importantly, the course covers computational methods including: data clustering, gene-set enrichment analysis, interactive data visualization, and supervised learning. Finally, we introduce crowdsourcing/citizen-science projects where students can work together in teams to extract expression signatures from public databases and then query such collections of signatures against LINCS data for predicting small molecules as potential therapeutics.

Jun 29th 2026
5-12 Weeks
Data-driven Decision Making (Coursera) Coursera
PwC

Data-driven Decision Making (Coursera)

Welcome to Data-driven Decision Making. In this course, you'll get an introduction to Data Analytics and its role in business decisions. You'll learn why data is important and how it has evolved. You'll be introduced to “Big Data” and how it is used. You'll also be introduced to a framework for conducting Data Analysis and what tools and techniques are commonly used. Finally, you'll have a chance to put your knowledge to work in a simulated business setting. This course was created by PricewaterhouseCoopers LLP with an address at 300 Madison Avenue, New York, New York, 10017.

Jun 29th 2026
4 Weeks
Accounting Analytics (Coursera) Coursera
University of Pennsylvania

Accounting Analytics (Coursera)

Accounting Analytics explores how financial statement data and non-financial metrics can be linked to financial performance. In this course, taught by Wharton’s acclaimed accounting professors, you’ll learn how data is used to assess what drives financial performance and to forecast future financial scenarios. While many accounting and financial organizations deliver data, accounting analytics deploys that data to deliver insight, and this course will explore the many areas in which accounting data provides insight into other business areas including consumer behavior predictions, corporate strategy, risk management, optimization, and more.

Jun 29th 2026
4 Weeks
Deploying Machine Learning Models (Coursera) Coursera
University of California, San Diego

Deploying Machine Learning Models (Coursera)

In this course we will learn about Recommender Systems (which we will study for the Capstone project), and also look at deployment issues for data products. By the end of this course, you should be able to implement a working recommender system (e.g. to predict ratings, or generate lists of related products), and you should understand the tools and techniques required to deploy such a working system on real-world, large-scale datasets.

Jun 22nd 2026
4 Weeks
Cloud Computing Applications, Part 2: Big Data and Applications in the Cloud (Coursera) Coursera
University of Illinois at Urbana-Champaign

Cloud Computing Applications, Part 2: Big Data and Applications in the Cloud (Coursera)

Welcome to the Cloud Computing Applications course, the second part of a two-course series designed to give you a comprehensive view on the world of Cloud Computing and Big Data! In this second course we continue Cloud Computing Applications by exploring how the Cloud opens up data analytics of huge volumes of data that are static or streamed at high velocity and represent an enormous variety of information. Cloud applications and data analytics represent a disruptive change in the ways that society is informed by, and uses information.

Jun 22nd 2026
4 Weeks
Machine Learning With Big Data (Coursera) Coursera
University of California, San Diego

Machine Learning With Big Data (Coursera)

Want to make sense of the volumes of data you have collected? Need to incorporate data-driven decisions into your process? This course provides an overview of machine learning techniques to explore, analyze, and leverage data. You will be introduced to tools and algorithms you can use to create machine learning models that learn from data, and to scale those models up to big data problems.

Jun 22nd 2026
5-12 Weeks
Hadoop Platform and Application Framework (Coursera) Coursera
University of California, San Diego

Hadoop Platform and Application Framework (Coursera)

This course is for novice programmers or business people who'd like to understand the core tools used to wrangle and analyze big data. With no prior experience, you'll have the opportunity to walk through hands-on examples with Hadoop and Spark frameworks, two of the most common in the industry. You will be comfortable explaining the specific components and basic processes of the Hadoop architecture, software stack, and execution environment.

Jun 29th 2026
5-12 Weeks
Foundations of mining non-structured medical data (Coursera) Coursera
EIT Digital

Foundations of mining non-structured medical data (Coursera)

The goal of this course is to understand the foundations of Big Data and the data that is being generated in the health domain and how the use of technology would help to integrate and exploit all those data to extract meaningful information that can be later used in different sectors of the health domain from physicians to management, from patients to care givers, etc.

Jun 29th 2026
5-12 Weeks