Mit Data Science Machine Learning Program Brochure
Mit Data Science Machine Learning Program Brochure
Mit Data Science Machine Learning Program Brochure
MACHINE LEARNING:
MAKING DATA-DRIVEN
DECISIONS
Become a data-driven decision maker with
the 10-week online program delivered
by MIT faculty
ABOUT
MIT IDSS
Education and research at MIT Institute for Data, Systems, and Society (IDSS) are
undertaken with the aim to provide solutions to complex societal challenges by
understanding and analyzing data. The institute is thus committed to the development of
analytical methods that can be applied to diverse areas such as finance, energy systems,
urbanization, social networks, and health.
MIT IDSS embraces the collision and synthesis of ideas and methods from analytical
disciplines including statistics, data science, information theory and inference, systems
and control theory, optimization, economics, human and social behavior, and network
science. These disciplines are relevant both for understanding complex systems and for
presenting design principles and architectures that allow for the systems’ quantification
and management.
MISSION
The mission of MIT IDSS is to advance education and research in state-of-the-art
analytical methods in information and decision systems, statistics and data science, and
the social sciences, and to apply these methods to address complex societal challenges
in a diverse set of areas such as finance, energy systems, urbanization, social networks,
and health.
02
ABOUT
THE PROGRAM
Demand for professionals skilled in data, analytics, and machine learning is exploding.
According to a report by the U.S. Bureau of Labor Statistics, the demand for data science
is set to increase, creating 11.5 million new data-driven jobs by 2026. Data scientists bring
value to organizations across industries because they are able to solve complex
challenges with data and drive important decision-making processes.
The MIT Institute for Data, Systems, and Society (IDSS) understands the power of
uncovering the true value of your data and has created a variety of online courses and
programs to take your data analytics skills to the next level. Whether you are looking to
break into the field, seeking career development opportunities, or simply want to provide
more valuable insights to your company, these offerings will teach you to harness data in
new and innovative ways.
03
PROGRAM
BENEFITS
Learn online from 11 award-winning Get a Certificate of Completion
MIT faculty and instructors by MIT IDSS
PROGRAM
STRUCTURE
The program is 10 weeks long:
04
WHO IS THIS
PROGRAM FOR?
Data Scientists, Data Analysts, and working professionals who wish to extract
actionable insights from large volumes of data
Due to the broad nature of the program, it is suited for both early career
professionals and senior managers, including technical managers, business
intelligence analysts, data science managers, data science enthusiasts, IT
practitioners, management consultants, and business managers
05
PROGRAM
CURRICULUM
The program is 10 weeks long:
MODULE 1 WEEK 1 -2
MODULE 2 WEEK 3
Clustering
What is Clustering?
When to use Clustering
K-means Preliminaries
The K-means algorithm
How to evaluate Clustering
Beyond K-means: What really makes a Cluster?
Beyond K-means: Other notions of distance
Beyond K-means: Data and pre-processing
Beyond K-means: Big data and Nonparametric Bayes
Beyond Clustering
06
Spectral Clustering, Components, and Embeddings
What if we do not have features to describe the data or not all are
meaningful?
Finding the principal components in data and applications
The magic of Eigenvectors I
Clustering in Graphs and Networks
Features from graphs: The magic of Eigenvectors II
Spectral Clustering
Modularity Clustering
Embeddings: New features and their meaning
MODULE 3 WEEK 4
07
The Use of Modern Regression for Causal Inference
Randomized Control Trials
Observational Studies with Confounding
MODULE 4 WEEK 5
MODULE 5 WEEK 6
Deep Learning
What is Image Classification? Introduce ImageNet and show examples
Classification using a single linear threshold (perceptron)
Hierarchical representations
Fitting parameters using back-propagation
Non-convex functions
How interpretable are its features?
Manipulating Deep Nets (Ostrich Example)
Transfer Learning
Other applications I: Speech Recognition
Other applications II: Natural Language Processing
08
LEARNING BREAK WEEK 7
MODULE 6 WEEK 8
Recommendation Systems
Recommendations and Ranking
What does a Recommendation System do?
What is the Recommendation Prediction Problem? And what
data do we have?
Using Population Averages
Using Population Comparisons and Ranking
Collaborative Filtering
Personalization using collaborative filtering using similar users
Personalization using collaborative filtering using similar items
Personalization using collaborative filtering using similar users
and items
Case Study 2: Recommend new songs to the users based on their listening habits
Personalized Recommendations
Personalization using Comparisons, Rankings, and Users-items
Hidden Markov Model / Neural Nets, Bipartite graph, and Graphical Model
Using side-information
20 questions and active learning
Building a system: Algorithmic and system challenges
09
MODULE 7 WEEK 9
Networks
Centrality measures: degree, eigenvector, and page-rank
Closeness and betweenness centrality
Degree distribution, clustering, and small world
Network Models: Erdos-Renyi, configuration model, preferential attachment
Stochastic Models on networks for the spread of viruses or ideas
Influence maximization
Case Study 2: Identifying new genes that cause autism
Graphical Models
Undirected Graphical Models
Ising and Gaussian Models
Learning Graphical Models from data
Directed graphical models
V-structures, “explaining away,” and learning Directed Graphical Models
Inference in Graphical Models: Marginals and message passing
Hidden Markov Model (HMM)
Kalman Filter
10
MODULE 8 WEEK 10
Predictive Analytics
Predictive Modeling for Temporal Data
Prediction Engineering
Case Study 1: NYC Taxi
Feature Engineering
Introduction
Feature Types
Deep Feature Synthesis: Primitives and Algorithms
Deep Feature Synthesis: Stacking
Case Study 2: UK Retail Dataset
Assessment: Graded Case Study - NYC Taxi Trips
11
FACULTY
Devavrat Shah
Professor, EECS and IDSS, MIT
Philippe Rigollet
Professor, Mathematics and IDSS, MIT
Caroline Uhler
Henry L. & Grace Doherty Associate Professor,
EECS and IDSS, MIT
Victor Chernozhukov
Professor, Economics and IDSS, MIT
Stefanie Jegelka
X-Consortium Career Development Associate Professor,
EECS and IDSS, MIT
12
Ankur Moitra
Rockwell International Career Development Associate Professor,
Mathematics and IDSS, MIT
Tamara Broderick
Associate Professor, EECS and IDSS, MIT
David D. Gamarnik
Nanyang Technological University Professor of Operations
Research, Sloan School of Management and IDSS, MIT
Jonathan Kelner
Professor, Mathematics, MIT
Kalyan Veeramachaneni
Principal Research Scientist at the Laboratory for Information
and Decision Systems, MIT
Guy Bresler
Associate Professor, EECS and IDSS, MIT
13
PROGRAM
MENTORS
The program coaches you to work on
hands-on industry-relevant projects by
Data Science and Machine Learning
experts via live and personalized
mentored learning sessions to give
you a practical understanding of core
concepts. A few of the industry
experts engaged with us as program
mentors include:
Roman Mozil
Applied Data Scientist,
Finning, Canada
Matt Nickens
Manager Data Science, PROGRAM
The Walt Disney Studios, US
MANAG E R :
Subhodeep Dey
GUIDE
Data Scientist, Your dedicated Program Manager,
United Health Group, India
provided by Great Learning, will be
your single point of contact for all
academic and non-academic queries
Bhaskarjit Sarmah in the program. They will keep track
Data Scientist,
of your learning journey, give you
BlackRock, India
personalized feedback, and the
required nudges to ensure your
success.
14
CERTIFICATE
The image is for illustrative purposes only. The actual certificate may be subject to change at the discretion of the university.
A P P L I C AT I O N P R O C E S S
Step-1 Step-2 Step-3
Application Form Application Screening Join the Program
Register by completing Your application will be If selected, you will receive
the online application reviewed to determine an offer for the upcoming
form. whether you're eligible cohort. Secure your seat by
for this program. paying the fee.
A P P L I C AT I O N & F E E D E TA I L S
Program Duration: 10 weeks
Fees: USD 1,700
Start Date: October 16, 2021
15
MIT IDSS Data Science and Machine Learning Program, with curriculum
developed and taught by MIT faculty, is delivered in collaboration with
16
READY TO BECOME
A DATA-DRIVEN
DECISION MAKER?
A P P LY N O W