Resource-Efficient Deep Learning Execution - Deepak Narayanan | Stanford MLSys #50 | Stanford MLSys Seminars

This podcast episode explores the challenges of training and inference in deep neural networks, specifically focusing on the impact of hardware heterogeneity, parallelization strategies, and resource allocation. It introduces GAVL, a new heterogeneity-aware preemption-based scheduler, and discusses the concept of effective throughput for optimizing resource allocation. The conversation also delves into the cost savings of using preemptible instances in cloud services and the potential for federated learning in edge devices. The implications of model parallelism and the importance of better tooling in distributed training are highlighted. The ultimate goal is to democratize optimization techniques and improve the efficiency of model training.

Outlines

Sign in to continue reading, translating and more.

Open full episode in Podwise

Resource-Efficient Deep Learning Execution - Deepak Narayanan | Stanford MLSys #50

Stanford MLSys Seminars

Challenges and Performance Optimization in Training Deep Neural Networks

The Impact of Heterogeneity and Parallelization on Deep Learning Performance

Performance Trade-offs in Distributed Training and Job Scheduling

The Impact of Heterogeneity on Resource Allocation in Machine Learning Deployments

GAVL: A Heterogeneity-Aware Preemption-Based Scheduler for Optimization

Performance Heterogeneity and Resource Allocation in Machine Learning Training

Optimizing Cloud Resource Usage and Assessing Policies for Preemptible Instances

Impact of Dynamic Decisions on Assessing Policies in Machine Learning

Understanding Inference and Challenges Compared to Training

Differences in Federated Learning and Model Parallelism

Making Model Parallelism Accessible for Small Organizations and Individuals

Better Tooling for Optimization: Making Distributed Training Accessible for Non-Experts

The Journey Towards Democratizing Distributed Training

Resource-Efficient Deep Learning Execution - Deepak Narayanan | Stanford MLSys #50

Stanford MLSys Seminars

00:01Challenges and Performance Optimization in Training Deep Neural Networks

Challenges and Performance Optimization in Training Deep Neural Networks

03:23The Impact of Heterogeneity and Parallelization on Deep Learning Performance

The Impact of Heterogeneity and Parallelization on Deep Learning Performance

09:52Performance Trade-offs in Distributed Training and Job Scheduling

Performance Trade-offs in Distributed Training and Job Scheduling

17:11The Impact of Heterogeneity on Resource Allocation in Machine Learning Deployments

The Impact of Heterogeneity on Resource Allocation in Machine Learning Deployments

20:20GAVL: A Heterogeneity-Aware Preemption-Based Scheduler for Optimization

GAVL: A Heterogeneity-Aware Preemption-Based Scheduler for Optimization

27:08Performance Heterogeneity and Resource Allocation in Machine Learning Training

Performance Heterogeneity and Resource Allocation in Machine Learning Training

31:42Optimizing Cloud Resource Usage and Assessing Policies for Preemptible Instances

Optimizing Cloud Resource Usage and Assessing Policies for Preemptible Instances

34:15Impact of Dynamic Decisions on Assessing Policies in Machine Learning

Impact of Dynamic Decisions on Assessing Policies in Machine Learning

36:50Understanding Inference and Challenges Compared to Training

Understanding Inference and Challenges Compared to Training

39:56Differences in Federated Learning and Model Parallelism

Differences in Federated Learning and Model Parallelism

44:01Making Model Parallelism Accessible for Small Organizations and Individuals

Making Model Parallelism Accessible for Small Organizations and Individuals

49:41Better Tooling for Optimization: Making Distributed Training Accessible for Non-Experts

Better Tooling for Optimization: Making Distributed Training Accessible for Non-Experts

53:37The Journey Towards Democratizing Distributed Training

The Journey Towards Democratizing Distributed Training