What Makes a Reward Model a Good Teacher? An Optimization Perspective | Best AI papers explained | Podwise