Library
SIGN IN
Prev
Next
Summary
Mindmap
Transcript
Keywords
Highlights
Shownotes
Send
Trending
Ask AI
Library
You
Sign in
Help
Toggle theme
Trending
Ask AI
Library
You
Enjoy Podwise!
Enjoy Podwise!
Sign in to sync your playlist
Playlist 0/50
Prev
Next
YouTube
06 Dec 2024
15m
【人工智能】什么是强化学习中的奖励黑客 | Reward Hacking | OpenAI前安全主管翁荔最新长文 | 奖励函数 | RLHF | 古德哈特定律 | ICRH | 缓释措施
最佳拍档
YouTube
Play
Summary
Mindmap
Transcript
Keywords
Highlights
Shownotes
Sign in to access all AI-generated content
Continue
最佳拍档 - 【人工智能】什么是强化学习中的奖励黑客 | Reward Hacking | OpenAI前安全主管翁荔最新长文 | 奖励函数 | RLHF | 古德哈特定律 | ICRH | 缓释措施
Takeaways
Outlines
Q & A
Preview
How to Get Rich: Every Episode
Naval