Lecture 3 – Common Model Architectures (MIT How to AI Almost Anything, Spring 2025) | Paul Liang

In this lecture, Paul Liang aims to cover common model architectures in deep learning within a unified framework. He begins with logistics, including project proposal submissions and reading assignments focused on data and learning, specifically discussing the "bitter lesson" and "grokking" or "double descent" phenomena. The lecture outlines a unified paradigm for viewing different architectures, emphasizing the spectrum from domain-specific to general-purpose models. Liang discusses key factors for a good model, including capturing semantic information, granularity, data usage, resource constraints, and usability. He then delves into multimodal specific methods, recapping modality profiles and the steps in deep learning models: learning representations and combining them. Using sets and point clouds as examples, he explains data invariances and equivariances, followed by common architectures like temporal models, sequence models, transformers, spatial models (CNNs and vision transformers), and graph networks, all within the context of invariances and equivariances. The lecture concludes with a summary of how to model data effectively, emphasizing data collection, cleaning, normalization, visualization, evaluation, and the importance of understanding data invariances and equivariances.

Outlines

Sign in to continue reading, translating and more.

Continue

Lecture 3 – Common Model Architectures (MIT How to AI Almost Anything, Spring 2025)

Paul Liang

Introduction to Deep Learning Model Architectures

Modality Profiles and Unified Deep Learning Model View

Designing Models for Sets and Point Clouds: Invariance and Equivariance

Temporal Models: Sequence Classification and Generation

Sequence-to-Sequence Models, Attention Mechanisms, and Transformers

Spatial Data and Convolutional Neural Networks (CNNs)

Graph Networks and Modeling Summary

Lecture 3 – Common Model Architectures (MIT How to AI Almost Anything, Spring 2025)

Paul Liang

00:00Introduction to Deep Learning Model Architectures

Introduction to Deep Learning Model Architectures

07:01Modality Profiles and Unified Deep Learning Model View

Modality Profiles and Unified Deep Learning Model View

12:35Designing Models for Sets and Point Clouds: Invariance and Equivariance

Designing Models for Sets and Point Clouds: Invariance and Equivariance

22:23Temporal Models: Sequence Classification and Generation

Temporal Models: Sequence Classification and Generation

30:19Sequence-to-Sequence Models, Attention Mechanisms, and Transformers

Sequence-to-Sequence Models, Attention Mechanisms, and Transformers

42:29Spatial Data and Convolutional Neural Networks (CNNs)

Spatial Data and Convolutional Neural Networks (CNNs)

50:25Graph Networks and Modeling Summary

Graph Networks and Modeling Summary