Big data and Language 1 (Coursera)

Big data and Language 1 (Coursera)

In this course, students will understand characteristics of language through big data. Students will learn how to collect and analyze big data, and find linguistic features from the data. A number of approaches to the linguistic analysis of written and spoken texts will be discussed.

Class Deals by MOOC List - Click here and see Coursera's Active Discounts, Deals, and Promo Codes.

The class will consist of lecture videos which are approximately 1 hour and a quiz for each week. There will be a final project which requires students to conduct research on text data and language.

Syllabus

WEEK 1: Introduction to Big Data and Language
WEEK 2: Spoken and Written Data
WEEK 3: Corpus and Register
WEEK 4: Parts of Speech

Go to Class
MOOC List is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

Related Courses

Introdução à Ciência e Engenharia de Dados (Coursera) Coursera
FIA Business School

Introdução à Ciência e Engenharia de Dados (Coursera)

Neste curso, você aprenderá que os dados se tornaram o principal ativo de negócios nos dias de hoje. Com o aumento do Big Data e criação de novas tecnologias, as organizações em todo o mundo estão inovando e descobrindo novas formas para analisar o potencial dos dados à sua disposição, o que ajuda no crescimento, na lucratividade, no direcionamento das operações gerais e no aumento da satisfação do cliente. Mas para que tudo isso funcione corretamente e seja possível extrair todo o potencial de forma precisa e que seja viável para o negócio, criou-se a área de ciência de dados.

Jun 22nd 2026
4 Weeks
Foundations for Big Data Analysis with SQL (Coursera) Coursera
Cloudera

Foundations for Big Data Analysis with SQL (Coursera)

In this course, you'll get a big-picture view of using SQL for big data, starting with an overview of data, database systems, and the common querying language (SQL). Then you'll learn the characteristics of big data and SQL tools for working on big data platforms. You'll also install an exercise environment (virtual machine) to be used through the specialization courses, and you'll have an opportunity to do some initial exploration of databases and tables in that environment.

Jun 22nd 2026
5-12 Weeks
Healthcare Data Quality and Governance (Coursera) Coursera
University of California, Davis

Healthcare Data Quality and Governance (Coursera)

Career prospects are bright for those qualified to work with healthcare data or as Health Information Management (HIM) professionals. Perhaps you work in data analytics but are considering a move into healthcare, or you work in healthcare but are considering a transition into a new role. In either case, Healthcare Data Quality and Governance will provide insight into how valuable data assets are protected to maintain data quality. This serves care providers, patients, doctors, clinicians, and those who carry out the business of improving health outcomes.

Jun 22nd 2026
4 Weeks
Graph Analytics for Big Data (Coursera) Coursera
University of California, San Diego

Graph Analytics for Big Data (Coursera)

Want to understand your data network structure and how it changes under different conditions? Curious to know how to identify closely interacting clusters within a graph? Have you heard of the fast-growing area of graph analytics and want to learn more? This course gives you a broad overview of the field of graph analytics so you can learn new ways to model, store, retrieve and analyze graph-structured data.

Jun 22nd 2026
5-12 Weeks
Teaching Impacts of Technology: Data Collection, Use, and Privacy (Coursera) Coursera
University of California, San Diego

Teaching Impacts of Technology: Data Collection, Use, and Privacy (Coursera)

In this course you’ll focus on how constant data collection and big data analysis have impacted us, exploring the interplay between using your data and protecting it, as well as thinking about what it could do for you in the future. This will be done through a series of paired teaching sections, exploring a specific “Impact of Computing” in your typical day and the “Technologies and Computing Concepts” that enable that impact, all at a K12-appropriate level.

Jun 17th 2026
4 Weeks
Big Data, Genes, and Medicine (Coursera) Coursera
The State University of New York

Big Data, Genes, and Medicine (Coursera)

This course distills for you expert knowledge and skills mastered by professionals in Health Big Data Science and Bioinformatics. You will learn exciting facts about the human body biology and chemistry, genetics, and medicine that will be intertwined with the science of Big Data and skills to harness the avalanche of data openly available at your fingertips and which we are just starting to make sense of.

Jun 22nd 2026
5-12 Weeks
Big Data Analysis with Scala and Spark (Coursera) Coursera
École Polytechnique Fédérale de Lausanne

Big Data Analysis with Scala and Spark (Coursera)

Manipulating big data distributed over a cluster using functional concepts is rampant in industry, and is arguably one of the first widespread industrial uses of functional ideas. This is evidenced by the popularity of MapReduce and Hadoop, and most recently Apache Spark, a fast, in-memory distributed collections framework written in Scala. In this course, we'll see how the data parallel paradigm can be extended to the distributed case, using Spark throughout.

Jun 22nd 2026
4 Weeks
Managing Big Data in Clusters and Cloud Storage (Coursera) Coursera
Cloudera

Managing Big Data in Clusters and Cloud Storage (Coursera)

In this course, you'll learn how to manage big datasets, how to load them into clusters and cloud storage, and how to apply structure to the data so that you can run queries on it using distributed SQL engines like Apache Hive and Apache Impala. You’ll learn how to choose the right data types, storage systems, and file formats based on which tools you’ll use and what performance you need.

Jun 22nd 2026
5-12 Weeks
Managing Big Data with MySQL (Coursera) Coursera
Duke University

Managing Big Data with MySQL (Coursera)

This course is an introduction to how to use relational databases in business analysis. You will learn how relational databases work, and how to use entity-relationship diagrams to display the structure of the data held within them. This knowledge will help you understand how data needs to be collected in business contexts, and help you identify features you want to consider if you are involved in implementing new data collection efforts.

Jun 22nd 2026
5-12 Weeks
Scalable Machine Learning on Big Data using Apache Spark (Coursera) Coursera
IBM

Scalable Machine Learning on Big Data using Apache Spark (Coursera)

This course will empower you with the skills to scale data science and machine learning (ML) tasks on Big Data sets using Apache Spark. Most real world machine learning work involves very large data sets that go beyond the CPU, memory and storage limitations of a single computer. Apache Spark is an open source framework that leverages cluster computing and distributed storage to process extremely large data sets in an efficient and cost effective manner. Therefore an applied knowledge of working with Apache Spark is a great asset and potential differentiator for a Machine Learning engineer.

Jun 22nd 2026
4 Weeks
Think Again I: How to Understand Arguments (Coursera) Coursera
Duke University

Think Again I: How to Understand Arguments (Coursera)

In this course, you will learn what an argument is. The definition of argument will enable you to identify when speakers are giving arguments and when they are not. Next, you will learn how to break an argument into its essential parts, how to put them in order to reveal their connections, and how to fill in gaps in an argument by adding suppressed premises. By the end of this course, you will be better able to understand and appreciate arguments that you and other people present.

Jun 22nd 2026
4 Weeks