EdX

Data Storage and Processing (edX)

Offered by ITMO University, ITMOx,
Data Storage and Processing (edX)

Master the culture of data representation, interpretation and outcomes evaluation. Learn the fundamentals of relational and NoSQL database management systems. Want to learn data processing and interpreting the result you’ve got? This course is for you! Get acquainted with preparing and analyzing large amount of data, as well as data storage fundamentals.

Class Deals by MOOC List - Click here and see EdX's Active Discounts, Deals, and Promo Codes.

This course is an introduction to initial data processing. We will start with data types and sources, methods of data preparation: cleaning, filling in the missing values, data smoothing and normalization. The course will familiarize you with the descriptive statistics and data visualization methods. You will also learn how to analyze time series and find trends.
Get acquainted with the fundamentals of data storage and access: databases types, relational and NoSQL databases, big data initials.
No previous programming knowledge needed.
This course is part of the Data Processing and Analysis Professional Certificate.

What you'll learn

  • Initial data processing (data cleaning and filling in the missing values)
  • Data smoothing and normalization
  • Data visualization
  • Time series analysis
  • Descriptive statistics
  • Data storage and access by means of relational DBMS
  • NoSQL databases and Big data

Syllabus

Week 1:Data preprocessing. Basic concepts of data processing. Stages of data analysis (collection, sorting, transformation, building models and interpretation). Data measurements and scales. Data types and sources. Data preparing.

Week 2:Data processing tools and visualization. Digital spreadsheets. Data visualization goals. Methods and purposes of correct data visualization.

Week 3: Data processing. Descriptive statistics. Data normalization and transformation. Time-series analysis and forecasting. Types of time-series smoothing. Trends, seasonal time series modelling.

Week 4:Relational databases management systems. Introduction to relational DBMS starting from relational data model. SQL statements and queries creation. Database indexes and transactions requirements.

Week 5:NoSQL. Main characteristics of not only SQL databases. Non-structured and semi-structured data and scalability of NoSQL databases. Types of NoSQL databases: column-oriented, key-value store, document store and graph databases.

Go to Class
MOOC List is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

Related Courses

Big Data Strategies to Transform Your Business (edX) EdX
Delft University of Technology,DelftX

Big Data Strategies to Transform Your Business (edX)

Make your organization’s business strategy and model, as well as your own career path, future-proof by using big data’s disruptive power. While big data infiltrates all walks of life, most firms have not changed sufficiently to meet the challenges that come with it. In this course, you will learn how to develop a big data strategy, transform your business model and your organization. This course will enable professionals to take their organization and their own career to the next level, regardless of their background and position.

Self Paced
Self-Paced
Introduction to Management Information Systems (MIS): A Survival Guide (edX) EdX
Universidad Carlos III de Madrid - UC3M,UC3Mx

Introduction to Management Information Systems (MIS): A Survival Guide (edX)

Gain the skills and knowledge needed to succeed in an MIS-dominated corporate world. This MIS course will cover supporting tech infrastructures (Cloud, Databases, Big Data), the MIS development/ procurement process, and the main integrated systems, ERPs, such as SAP®, Oracle® or Microsoft Dynamics Navision®, as well as their relationship with Business Process Redesign.

Self Paced
Self-Paced
Biostatistics for Big Data Applications (edX) EdX
University of Texas Medical Branch

Biostatistics for Big Data Applications (edX)

Learn data analysis basics for working with biomedical big data with practical hands-on examples using R. This course provides a broad foundation of statistical terms and concepts as well as an introduction to the R statistical software package. The topics covered are fundamental components of biostatistical methods used in both omics and population health research.

No sessions Available
5-12 Weeks
Wiretaps to Big Data: Privacy and Surveillance in the Age of Interconnection (edX) EdX
Cornell University

Wiretaps to Big Data: Privacy and Surveillance in the Age of Interconnection (edX)

Explore the privacy issues of an interconnected world. How does cellular technology enable massive surveillance? Do users have rights against surveillance? How does surveillance affect how we use cellular and other technologies? How does it affect our democratic institutions? Do you know that the metadata collected by a cellular network speaks volumes about its users? In this course you will explore all of these questions while investigating related issues in WiFi and Internet surveillance.

No sessions available
5-12 Weeks
Programming for Data Science (edX) EdX
University of Adelaide,AdelaideX

Programming for Data Science (edX)

Learn how to apply fundamental programming concepts, computational thinking and data analysis techniques to solve real-world data science problems. There is a rising demand for people with the skills to work with Big Data sets and this course can start you on your journey through our Big Data MicroMasters program towards a recognised credential in this highly competitive area. Using practical activities you will learn how digital technologies work and will develop your coding skills through engaging and collaborative assignments.

Self Paced
Self-Paced
Computational Thinking and Big Data (edX) EdX
University of Adelaide,AdelaideX

Computational Thinking and Big Data (edX)

Learn the core concepts of computational thinking and how to collect, clean and consolidate large-scale datasets. Computational thinking is an invaluable skill that can be used across every industry, as it allows you to formulate a problem and express a solution in such a way that a computer can effectively carry it out.

Self Paced
Self-Paced
Big Data Capstone Project (edX) EdX
University of Adelaide,AdelaideX

Big Data Capstone Project (edX)

Further develop your knowledge of big data by applying the skills you have learned to a real-world data science project. This project will give you the opportunity to deepen your learning by giving you valuable experience in evaluating, selecting and applying relevant data science techniques, principles and theory to a data science problem. This project will see you plan and execute a reasonably substantial project and demonstrate autonomy, initiative and accountability.

Self Paced
Self-Paced
Hacking PostgreSQL: Data Access Methods (edX) EdX
Ural Federal University,UrFUx

Hacking PostgreSQL: Data Access Methods (edX)

Learn the science, engineering practices and hacking techniques of data access – core aspects of information processing in a database. This course is about data storage and data processing technologies with examples from PostgreSQL. It is geared toward database core developers, operation systems developers, system architects, and all those who want to understand databases in more detail.

No sessions available
13-24 Weeks
Enabling Technologies for Data Science and Analytics: The Internet of Things (edX) EdX
Columbia University,ColumbiaX

Enabling Technologies for Data Science and Analytics: The Internet of Things (edX)

Discover the relationship between Big Data and the Internet of Things (IoT). The Internet of Things is rapidly growing. It is predicted that more than 25 billion devices will be connected by 2020. In this data science course, you will learn about the major components of the Internet of Things and how data is acquired from sensors. You will also examine ways of analyzing event data, sentiment analysis, facial recognition software and how data generated from devices can be used to make decisions.

Self Paced
Self-Paced
Análisis de datos: Diseño y Visualización de Tableros (edX) EdX
Delft University of Technology,DelftX

Análisis de datos: Diseño y Visualización de Tableros (edX)

Aprende cómo transformar datos sin procesar, con el uso de tableros en Excel, para apoyar las decisiones de negocio. ¿Luchando con los datos en tu trabajo? ¿Gastando tiempo valioso trabajando en muchas hojas de cálculo en Excel para obtener un resumen de tu negocio? ¿Tienes dificultades para obtener un tablero detallado a partir de montones de datos en tu escritorio? ¿Quieres entender cómo analizar Big Data?

Self Paced
Self-Paced