Syllabus

Course Meeting Times

Lectures: 2 sessions/week, 1.5 hours/session

Prerequisites

Familiarity with Python at this level
Familiarity with fundamental machine learning concepts such as training/validation/testing, overfitting/underfitting, and regularization
If you have taken 15.071 The Analytics Edge, 15.072 Advanced Analytics Edge (or will be taking one of these concurrently), or if you have other relevant coursework or work experience, you should be fine

Description

Deep learning is the engine behind all the Predictive and Generative AI advances that we see around us today. Starting from around 2010, this single algorithmic strategy has beaten incumbents and broken records in multiple areas: speed recognition, image recognition, natural language processing and so on. Deep learning is considered by many to be a general-purpose technology—like electricity and the Internet—whose impact will be pervasive and profound, and it is well on its way to revolutionizing many fields, from business to the sciences.

This course will unpack deep learning, developing its building blocks from scratch. The emphasis will be on developing a deep, hands-on understanding of how to build models to solve complex problems involving the processing of unstructured inputs (e.g., how do we detect if a driver is falling asleep?) and the generation of unstructured outputs (e.g., how do we summarize the content of a customer-service call transcript?).

You will learn the basics of deep neural networks—layers and activations—and how to set up and train them. You will learn about special-purpose networks that have been invented in the field of computer vision to process images and videos (convolutional networks) and networks that have been invented in the field of natural language processing to process text and sequences (transformers). You will learn how large language models (LLMs) like GPT-4 are built and how to adapt LLMs to specific business applications. You will roll up our sleeves and write Python programs (using the powerful Tensorflow/Keras deep learning software framework) to create deep learning models and train them on real-world datasets.

Throughout the course, we will examine in detail how deep learning is being applied to a range of opportunities and problem areas. The class will place an emphasis on connecting this fascinating new technology to sources of business value.

This course is an approved elective for MBAn and the Business Analytics Certificate.

Grading

Your course grade will be based on two homework assignments, a final project, and class participation:

Class participation	10%
Homework assignments (25% x 2)	50%
Final project	40%

Course Textbook

Deep Learning with Python by Francois Chollet, second edition, October 2021. ISBN: 9781617296864.

Schedule

See the Schedule page for more information on the topics and assignment distribution.

Teams

Students are required to form teams of four that will stay fixed for the duration of the class. Cross-sectional teams are not allowed since teams will be presenting their projects in their respective sections.

We ask that students form teams via Canvas within the first two weeks of class. Use the “People” link on Canvas and navigate to the “Project Groups” tab.

Projects

A major deliverable for the course is a final project. You will submit a 1-page proposal for your final project in the fourth week of class. The proposal will clearly describe the problem (both from a business and technical perspective), the dataset, and your proposed approach(es).

We encourage each team to assemble/curate their own dataset. If that proves to be infeasible given the time constraints of a half-semester class, you are welcome to use publicly available datasets. To that end, the following resources may be helpful:

Teams that do bring their own dataset to the project will get extra credit.

All project submissions must be made available under the MIT license and posted to Github.

Browse Course Material

Course Info

Instructor

Departments

As Taught In

Level

Topics

Learning Resource Types

Hands-on Deep Learning