Unit 5. Vision and Language

Lecture 5.2: Andrei Barbu - From Language to Vision and Back Again

Description: Using higher level knowledge to improve object detection, language-vision model that simultaneously processes sentences and recognizes image objects and events, performing tasks like image/video retrieval, generating descriptions, and question answering.

Instructor: Andrei Barbu


Course Info

Learning Resource Types

theaters Other Video
theaters Lecture Videos
notes Lecture Notes
group_work Projects
co_present Instructor Insights