Stanford CS230 | Autumn 2025 | Lecture 4: Adversarial Robustness and Generative Models
Stanford Online
In this lecture, Kian Katanforoosh explores two main topics: adversarial robustness and generative modeling. The discussion on adversarial robustness covers attacks on AI models, including prompt injection and data poisoning, and the importance of building proactive defenses. Katanforoosh outlines three waves of adversarial attacks, highlighting how models are increasingly vulnerable due to their reliance on instructions and context. The lecture then transitions to generative models, focusing on GANs and diffusion models, which are used in image and video generation. Katanforoosh explains the differences between discriminative and generative models, emphasizing the latter's ability to learn the underlying distribution of data. The session includes interactive Q&A, addressing concerns about the sensitivity of neural networks to forged images and potential defenses against attacks.
Part 1: Introduction to Attacks
Part 2: Generative Adversarial Networks (GANs)
Part 3: Diffusion Models
Sign in to continue reading, translating and more.
Open full episode in Podwise
