26:["$","$L2f",null,{"data":{"isPreview":true,"seq":7375156,"episode":{"Id":"d6bca193cd95b21877f38dc3a8aafb6adc53fe797d794f55873986660fadcd15","Seq":7375156,"PodId":"c2d6b50707f47c5b2af65a35314bc77065b579cc615d7f559bf53717cbc4938f","PodSeq":24594,"Title":"Advantage-Weighted Regression: Simple and Scalable Off-Policy RL","PodName":"Best AI papers explained","Description":"

This paper introduces and explains Advantage-Weighted Regression (AWR), a simple and scalable off-policy reinforcement learning algorithm that utilizes standard supervised learning techniques. The paper details AWR's theoretical basis, highlighting its connection to constrained policy optimization and its ability to effectively handle off-policy data through experience replay. The authors demonstrate AWR's competitive performance against existing methods on benchmark tasks and complex simulated character control, also showing its strength in learning from purely static datasets. Overall, the work presents AWR as a promising and straightforward approach to reinforcement learning.

\n","Url":"https://podcasters.spotify.com/pod/show/ehwkang/episodes/Advantage-Weighted-Regression-Simple-and-Scalable-Off-Policy-RL-e32tub1","Link":"https://anchor.fm/s/1026675f8/podcast/play/102741793/https%3A%2F%2Fd3ctxlq1ktw2nl.cloudfront.net%2Fstaging%2F2025-4-16%2F400389333-44100-2-f4de4f6d88977.m4a","LinkType":"m4a","PublishTime":"$D2025-05-16T04:47:58.000Z","Img":"https://d3t3ozftmdmh3i.cloudfront.net/staging/podcast_uploaded_nologo/43252366/43252366-1744500070152-e62b760188d8.jpg","EpImg":"https://d3t3ozftmdmh3i.cloudfront.net/staging/podcast_uploaded_nologo/43252366/43252366-1744500070152-e62b760188d8.jpg","Duration":"00:18:38","Language":null,"SampleDuration":null,"IsVBR":false,"Transcribed":false,"Indexed":1,"Deleted":false,"RedirectSeq":null,"Source":null,"Size":null},"prevAndNext":{"prevSeq":7375155,"nextSeq":7375157},"states":{"state":"not-login","extra":{"summary":"Best AI papers explained - Advantage-Weighted Regression: Simple and Scalable Off-Policy RL","previewContent":{"summary":"Best AI papers explained - Advantage-Weighted Regression: Simple and Scalable Off-Policy RL","chapters":[],"keywords":[],"highlights":[],"transcripts":[]}}}}}]