Arxiv Preprint - A General Theoretical Paradigm to Understand Learning from Human Preferences | AI Breakdown | Podwise