15.773 | Spring 2024 | Graduate

Hands-On Deep Learning

Refining the Course from Year to Year

Below, Prof. Rama Ramakrishnan describes how he has changed the course since its original conception.

We make a number of small tweaks every year, but perhaps the biggest change is in how we teach the transformer architecture. 

Previously, we used to teach the use of convolutional neural networks as the fundamental deep learning approach for solving computer vision problems, and the transformer for natural language processing problems. Given the increasing use of transformers for computer vision problems, we have switched to using transformers for both natural language processing and computer vision. We also show how to use transformers to solve problems with multi-modal input data.

Course Info

Spring 2024
Instructor Insights
Lecture Notes
Lecture Videos
Problem Set Solutions
Problem Sets
Programming Assignments