YouTube08 Dec 2025
1h 7m

Stanford CS224R Deep Reinforcement Learning | Spring 2025 | Lecture 2: Imitation Learning

Podcast cover

Stanford Online

The podcast discusses imitation learning, a method for training policies by mimicking expert demonstrations. It covers representing policy distributions using neural networks, emphasizing the importance of expressive distributions to capture the multimodality often present in expert data. The discussion includes techniques like Gaussian mixture models, discretized actions with autoregressive models, and diffusion models. The podcast addresses challenges such as compounding errors and covariate shift, and it introduces strategies for collecting corrective data through online interventions like the DAgger algorithm to improve policy robustness.

Outlines

Part 1: Fundamentals, Supervised Learning

Part 2: Expressive Distributions, Generative Models

Part 3: Error Correction, Online Methods

Sign in to continue reading, translating and more.

Open full episode in Podwise