Syllabus

Course Meeting Times

Lectures: 2 sessions / week, 1.5 hours / session

Recitations: 1 sessions / week, 1 hour / session

Prerequisites

15.060 Data, Models, and Decisions or a basic statistics and a basic optimization course.

Course Description

In the last decade, the amount of data available to organizations has reached unprecedented levels. Companies and individuals who can use this data together with analytics give themselves an edge over the competition. In this class, we examine real world examples of how analytics have been used to transform a business or industry. These examples include Moneyball, the Framingham Heart Study, Google, Twitter, IBM Watson, and Netflix among many others. Through these examples and many more, we cover the following analytics methods: linear regression, logistic regression, trees, deep learning, missing data imputation, text analytics, clustering, and optimization. In addition, we present new methods and applications from the research of the instructor: an optimization framework for regression problems, algorithms for missing data, optimal trees for prediction and prescription, algorithms from prediction to prescription, personalized medicine, and patterns of heart attacks among other topics. Students will use the R programming language, a free open statistical computational environment, and LibreOffice (or compatible), an open office suite.

Course Overview

This OCW course site includes content from Dimitris Bertsimas' residential 15.071 course, as well as the 15.071x MOOC offered on MITx authored by Dimitris Bertsimas and Allison O'Hair.

The course is organized by units. Weekly coursework includes:

  • Interactive lecture sequences
  • Recitation
  • Homework

Interactive Lecture Sequences

The lectures are presented in interactive sequences of videos and quick questions. Each sequence includes a succession of short video clips and online questions, arranged in a logical progression. Please take the time to watch each video and complete each question in the sequence they are provided. Answer-check mechanisms provided in these questions are designed to quickly test your understanding of the lecture material. When we work in R or LibreOffice during the lecture videos, we encourage you to follow along.

Recitations

Recitations will cover additional examples of the analytics methods presented in the lectures, and recitations will be used to show how to create models in R or LibreOffice in more depth. Recitation attendance is highly recommended.

Assignments

There will be nine individual homework assignments and a final project that should be done in teams of two. See the Assignments section for more details about the assignments and the final project.

Readings

The readings include chapters from the required book for the class: 

Dimitris Bertsimas, Allison O'Hair and Bill Pulleyblank, The Analytics Edge, Dynamic Ideas, 2016. ISBN: 978-0989910897.

Grading

Grades will be based on the following weighting:

Activities Percentages
Homework Assignments 45%
Final Course Project 45%
Class Participation 10%