Skip to content

Lessons

Predictive Modeling

This lesson is about predictive modeling. What is predictive modeling? Predictive modeling is a process of modeling historical data for predicting future events.

MapReduce

MapReduce is a powerful system that can perform some of the methods we have talked about so far. On big data set using distributed computation 

Classification

In this lesson, we’ll focus on performance evaluation, and go into details, what are those different performance evaluation metrics.

Ensemble Method

Let’s talk about how we actually develop those models using big data. We’ll start by talking about some of the algorithms used in big data analytics like stochastic gradient descent.

Phenotyping

In this lesson, we’re going to discuss a healthcare application of clustering  called phenotyping. Phenotypes are medical concepts such as diseases or conditions.

Clustering

Now we’re going to move on to a different method called clustering. In clustering, we want to discover those labels or categories form raw data.

Spark

We’ll introduce Spark, another big data system that provides better performance, by using distributed memory, across many machines. We’ll talk about the key concept behind Spark.

Medical Ontology

For historical reason, healthcare and  medicine domain have already developed many ontologies for organizing diseases, medical procedures, medications, lab tests, and more.

Graph Analysis

In this lesson we’ll talk about graph analysis. Graph analysis, is a set of methods, that are commonly used in search engines, and social networks.