11 Jan 2024
1h 25m

RLHF 201 - with Nathan Lambert of AI2 and Interconnects

Podcast cover

Latent Space: The AI Engineer Podcast

Reinforcement Learning from Human Feedback (RLHF) is a technique that combines reinforcement learning with human feedback to train language models. It involves using human preferences to guide the training process, with various challenges, including data collection, reward optimization, and preference aggregation. RLHF has potential applications in language model fine-tuning, decision-making, and dialogue system development.

Outlines

Sign in to continue reading, translating and more.

Open full episode in Podwise