26:["$","$L2f",null,{"data":{"isPreview":true,"seq":7375337,"episode":{"Id":"0af22df7cffa62adb6bc381804d6caf26d6fe8755f104acd39bddc1a9816ab4f","Seq":7375337,"PodId":"c2d6b50707f47c5b2af65a35314bc77065b579cc615d7f559bf53717cbc4938f","PodSeq":24594,"Title":"Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback","PodName":"Best AI papers explained","Description":"

The paper surveys limitations of reinforcement learning from human feedback (RLHF).
It highlights challenges in training AI systems with RLHF.
Proposes auditing and disclosure standards for RLHF systems.
Emphasizes a multi-layered approach for safer AI development.
Identifies open questions for further research in RLHF.

\n","Url":"https://podcasters.spotify.com/pod/show/ehwkang/episodes/Open-Problems-and-Fundamental-Limitations-of-Reinforcement-Learning-from-Human-Feedback-e306ca4","Link":"https://anchor.fm/s/1026675f8/podcast/play/99872516/https%3A%2F%2Fd3ctxlq1ktw2nl.cloudfront.net%2Fstaging%2F2025-2-14%2F0aec936a-a9e2-8df0-4e44-d73e958e684f.mp3","LinkType":"mp3","PublishTime":"$D2025-03-14T21:53:07.000Z","Img":"https://d3t3ozftmdmh3i.cloudfront.net/staging/podcast_uploaded_nologo/43252366/43252366-1744500070152-e62b760188d8.jpg","EpImg":"https://d3t3ozftmdmh3i.cloudfront.net/staging/podcast_uploaded_nologo/43252366/43252366-1744500070152-e62b760188d8.jpg","Duration":"00:01:45","Language":null,"SampleDuration":null,"IsVBR":false,"Transcribed":false,"Indexed":1,"Deleted":false,"RedirectSeq":null,"Source":null,"Size":null},"prevAndNext":{"prevSeq":7375336,"nextSeq":7375338},"states":{"state":"not-login","extra":{"summary":"Best AI papers explained - Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback","previewContent":{"summary":"Best AI papers explained - Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback","chapters":[],"keywords":[],"highlights":[],"transcripts":[]}}}}}]